Commit Graph

45 Commits

Author SHA1 Message Date
William Jeynes f821e9643d Add url validity metrics 2026-04-04 20:02:25 +01:00
William Jeynes 8c0921057b start on work to calculate % if valid URLS 2026-04-04 18:52:47 +01:00
William Jeynes f8d4155b7c Add more robust parsing of LLM JSON output 2026-03-27 11:09:59 +00:00
William Jeynes 5e374a8bd6 Fix errors seen during longer runs: selenium exceptions, insecure certificates, recusrsion limit exceeded, BM25 document corpus too small 2026-03-26 12:22:13 +00:00
William Jeynes fbc688b8f9 add date to returned data 2026-03-25 22:37:14 +00:00
William Jeynes a7f5978f64 Update documentation. Stop storing context. Decide on final claims source 2026-03-25 14:24:55 +00:00
William Jeynes 8f939d54c4 Implement ensemble into final model structure 2026-03-24 19:07:24 +00:00
William Jeynes 624d45bc53 Re-allow multithreading on service. Add results table 2026-03-24 18:29:40 +00:00
William Jeynes 80bc151379 add majority voting 2026-03-24 16:50:41 +00:00
William Jeynes 5ce64290ce Make an ensemble model to combine scores together (very high accuracy) 2026-03-24 15:50:41 +00:00
William Jeynes c69730df6b Refine scoring to allow for better iteration on frontend. Update generate_adversarial.py 2026-03-22 16:04:38 +00:00
William Jeynes f4e84af272 Make the model less overfitting. Make it harder for an event to be classed as "perfect" 2026-03-18 01:05:24 +00:00
William Jeynes 8052d5c7ba Working on making the classifier harsher on unseen data 2026-03-17 22:19:03 +00:00
William Jeynes c89c7054fe Update agent to support new verification style. Update frontend to support new file format and remove redundant logic from old experiments. 2026-03-16 17:16:58 +00:00
William Jeynes 0a7bb114d2 Add removing of duplicates from pipeline. Add to sort step. Move score logic to robertaMetrics node. 2026-03-13 14:51:14 +00:00
William Jeynes 8311556855 Add ROBERTA classifier ranking PoC, with 77pc off the bat 2026-03-13 11:24:51 +00:00
William Jeynes ef6330ec07 Add re-ranker mode to support re-ranking experiments, hopefully we can reduce the loss 2026-03-06 17:27:09 +00:00
William Jeynes 6ae551a93f Ensure date is passed to pipeline. Fix woring 2026-03-02 14:58:26 +00:00
William Jeynes 8317fd85df Add file logging for errors. Add exponential backoff retry to web search. On failed web search, do not crash pipeline, return placeholder text to loanguage model 2026-02-24 13:05:35 +00:00
William Jeynes 4d92f14527 Getting hits on the block list IMMEDIATLEY. Log to file, might be important later 2026-02-22 15:42:27 +00:00
William Jeynes 2f33338007 Do not enter existing data if it has no good trigger events 2026-02-22 15:29:48 +00:00
William Jeynes d1ab938c0b Add filtering from known disinformation sources 2026-02-22 15:14:58 +00:00
William Jeynes 8ffe8dec82 Use cleaned trigger events in input.jsonl 2026-02-19 12:23:38 +00:00
William Jeynes 5efce05821 Update REAME to include description of data files 2026-02-19 11:43:25 +00:00
William Jeynes 78a49e2843 Start writing cleaned jsonl output. Re-add sentence to trigger prompt. Fix recursion limit 2026-02-19 11:36:31 +00:00
William Jeynes 6f20ade780 Make open webpage more appealing 2026-02-19 10:07:39 +00:00
William Jeynes a2cb93b44e Start refining scorer. Filter data passed to trigger event agent 2026-02-18 15:03:13 +00:00
William Jeynes 3f14b61cd4 Move all data to own folder. Add run shell script. Experiment (unsuccessfully so far) with example retreival 2026-02-16 22:42:13 +00:00
William Jeynes 90894b2c10 Add some preliminary analysis 2026-02-16 14:42:47 +00:00
William Jeynes 6d478fe7ec Add multi claim runner. Add dbkf fetcher for automated testing. Add visualisation tool plus human score enterer. 2026-02-13 15:15:01 +00:00
William Jeynes fa6e7017b0 Add run scripts 2026-02-12 23:52:15 +00:00
William Jeynes 7fe63d6a98 Refactor calculating score. Add sort node for vanity 2026-02-12 23:46:00 +00:00
William Jeynes b06c08daab Add relation model. Add calculate score initial version 2026-02-12 23:26:59 +00:00
William Jeynes c89f73e138 Implement RAGAS metrics 2026-02-12 22:52:22 +00:00
William Jeynes 6dd6bf7eaf implement verification model 2026-02-12 22:32:24 +00:00
William Jeynes bef856d53a Refactor example retreiving, add option for dynamic data. Add hybrid reranking to tooling. Add parsing and loop infrastructure for trigger event processing 2026-02-12 14:33:12 +00:00
William Jeynes eba5eb40a2 Add RAGAS initial version 2026-02-09 21:26:54 +00:00
William Jeynes cd2c8621e8 FEAT: implement temp version of main tooling feedback loop 2026-02-09 20:25:36 +00:00
William Jeynes 5841e8a922 add search query 2026-02-09 16:45:17 +00:00
William Jeynes 02eac0f553 Allow multiple source CSV files for normalisation. Implement real model node. Add normalizarion prompt. Implement normalization setup. Start on RAG retreival functions 2026-02-09 16:32:40 +00:00
William Jeynes 8eaa7bfbff Add initial code for retreival ranking for normalisation 2026-01-29 21:53:38 +00:00
William Jeynes a1373da891 create final nodes 2026-01-28 22:03:21 +00:00
William Jeynes c6416622e4 start adding dummy nodes 2026-01-28 21:26:34 +00:00
William Jeynes a3201d17a2 add initial testing 2026-01-27 22:57:49 +00:00
William Jeynes fdf8be2414 Repository Structure 2026-01-27 21:09:33 +00:00