fcd0c435d3ee725f59eb958d2a39f72412781270
AI models for identifying trigger events in disinformation analysis
Final Dissertation Submission Repository - Experiments with created dataset
Links
Project Description: https://github.com/WillJeynes/LLMsForDisinformationAnalysis/
Get the dataset: https://huggingface.co/datasets/WillJeynes/LLMsForDisinformationAnalysis-Dataset
This Repository:
Finetuned Model
Tinetuning a LLM to better predict possible disinformation claims arising from world event
Kind of the opposite of dataset
Stats available here
Final LoRa version available here: https://huggingface.co/WillJeynes/LLMsForDisinformationPrediction
Graph Viz
A way to visualise the connections between claims and trigger events
Visible here: https://jillweynes.github.io/LLMsForDisinformationPrediction-GraphVizBuilt/
Repository Structure
├── query_model.py # call final finetuned LLM from hugging face
├── finemodel/
| ├── eval*.py # Call APIs
| ├── lora*.py, full.py # Train models against dataset
| └── q_*.py # Expose trained models as API
├── graphviz/
| ├── frontend/ # React + Parcel + react-force-graph frontend to visualise results
| └── processing/ # Python scripts to generate clusters and titles
└── data/ # Holder from project data
├── dataset.jsonl # Collated dataset - in full format
└── dataset.csv # Collated dataset - in CSV format
Description
Languages
Python
66%
TypeScript
33.5%
HTML
0.5%