Add new accuracy results

This commit is contained in:
William Jeynes
2026-04-05 11:50:53 +01:00
parent 42cf4da794
commit d21a8b537e
+7 -4
View File
@@ -14,10 +14,13 @@ Experiments modifying pipeline
Experiments with different model types: Experiments with different model types:
| Model | % Correct | % Change | | Model | % Correct | % Change |
|-------------------------------|----------:|---------:| |-------------------------------|----------:|---------:|
| gpt-5-mini | 33 | 0 | | gpt-5-mini | 45.51 | |
| gpt-5.4-mini | 32.4 | -0.02 | | gpt-5.4-mini | 32.4 | |
| llama3.1:8b-instruct-q4_K_M | ? | ? | | gpt-5.4-nano | 23.28 | |
| qwen3.5:9b | 0 | -100 | | gpt-4.1-mini | 27.85 | |
| gpt-4o-mini | 32.47 | |
| llama3.1:8b-instruct-q4_K_M | ? | |
| qwen3.5:9b | 0 | |
%age valid URLS %age valid URLS
| Model | Number | % Age | | Model | Number | % Age |