LLMsForDisinformationAnalysis/agent/README.md at cf923d6e87976db2be2d100d03036944084bbe4a - LLMsForDisinformationAnalysis - Gitea on Jillbox

jill/LLMsForDisinformationAnalysis

Files

T

William Jeynes cf923d6e87 Add new accuracy results

2026-04-05 11:51:28 +01:00

1.3 KiB

Raw Blame History

Refining the agent output

Experiments modifying pipeline

Model	% Correct	% Change
BASELINE	33	0
Improv Prompt	39.96	0.21
Add Examples	44.67	0.35
Date	45.51	0.38
Chain of Thought	43.38	0.31
Self-Critique	44.36	0.34

Experiments with different model types:

Model	% Correct	% Change
gpt-5-mini	45.51
gpt-5.4-mini	32.4
gpt-5.4-nano	23.28
gpt-4.1-mini	27.85
gpt-4o-mini	32.47
llama3.1:8b-instruct-q4_K_M	?
qwen3.5:9b	0

%age valid URLS

Model	Number	% Age
gpt-5-mini	22/405	5.43
gpt-5.4-mini	29/278	10.43
llama3.1:8b-instruct-q4_K_M	?	?
qwen3.5:9b	0	0