Update documentation. Stop storing context. Decide on final claims source

2026-03-25 14:24:55 +00:00
parent 872346c657
commit a7f5978f64
6 changed files with 34 additions and 8 deletions
@@ -1,3 +1,15 @@
+# Classifier work for evaluating model quality
+
+Made using a dataset of 1000 labeled claims from MVP pipeline.
+
+Roberta model trained on an augmented dataset with LLM generated adversarial examples for low frequency labels.
+
+Flan model trained using raw labelled claims, inherrent natural language ability allows for pattern recognition without the need for fake data.
+
+Regression model trained using the roberta dataset.
+
+Used ensemble model in the final version, with the component models available on Hugging Face. 
+
 | Model                                                      | % Correct | % Valid taken forward|Used in ensemble|Link
 |------------------------------------------------------------|-----------|----------------------|----------------|-
 | Original                                                   | 53.22     | 61.72                |