CitizensFoundation / pace-keyword-scanner
CommonCrawl keyword scanner. Time for month of CC data on EC2 c5.18xlarge instance for hundreds of keywords takes about 3 hours. LLM (BERT) based 2nd level filtering. Developed with support from the EU and the Populism & Civic Engagement H2020 project.
☆13Updated last year
Related projects: ⓘ
- Track changes to GraphQL APIs by git scraping their schemas☆22Updated this week
- Datasette showing global power plant data from https://github.com/wri/global-power-plant-database☆16Updated 2 years ago
- Dead simple cron service for making HTTP calls on a regular schedule.☆14Updated 4 years ago
- A demonstration transnational register of beneficial ownership data from the UK, Denmark, Slovakia and Armenia☆18Updated 2 months ago
- GitHub statistics☆11Updated 2 years ago
- Matrix-based News Aggregation to Explore Media Bias☆19Updated 6 years ago
- A visualisation library for beneficial ownership structures