CitizensFoundation / pace-keyword-scanner
CommonCrawl keyword scanner. Time for month of CC data on EC2 c5.18xlarge instance for hundreds of keywords takes about 3 hours. LLM (BERT) based 2nd level filtering. Developed with support from the EU and the Populism & Civic Engagement H2020 project.
☆14Updated 2 years ago
Alternatives and similar repositories for pace-keyword-scanner:
Users that are interested in pace-keyword-scanner are comparing it to the libraries listed below
- Real-Time Proxy & Web Scraping API☆24Updated 5 years ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 6 years ago
- Datasette showing global power plant data from https://github.com/wri/global-power-plant-database☆17Updated 2 years ago
- List of privacy-friendly analytics solutions☆19Updated last year
- A Google Trends Analytics Package☆13Updated 9 months ago
- ☆12Updated last year
- Helps you to visualize the site structure☆9Updated last year
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆16Updated last year
- Citadel: Enterprise Search☆13Updated last year
- Repository to allow collaboration between Cycle Labs Cloud community in support of the community.☆9Updated 3 years ago
- A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.☆19Updated 8 months ago
- Matomo plugin for Docusaurus v2/v3☆14Updated last year
- Fully customizable open source voice experience that can be hosted on any website.☆33Updated 2 years ago
- Ontology dataset for open_numbers namespace☆10Updated 4 months ago
- Active Citizen is an open source library, API and UI using various AI technologies aimed at empowering citizens democratically.☆44Updated 3 weeks ago
- Everyting you need to know about Aquila Network Neural Search Ecosystem. Official repositories, client libraries, ecosystem projects, boi…☆32Updated 3 years ago
- Penme is a lightweight open source note taking app focused on privacy!☆26Updated 4 years ago
- DocumentCloud's back end source code - Please report bugs, issues and feature requests to info@documentcloud.org☆37Updated this week
- I leverage this code to get every retweet information of per tweet in SinaWeibo☆2Updated 6 years ago
- Datasette plugin for rendering HTML based on JSON values☆26Updated 3 years ago
- 🗳️ Monitor your country, your city council or your organization promises and objectives☆14Updated 3 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 4 months ago
- Browser version of Hyphe (WIP)☆30Updated 5 months ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- Datasets used for articles and stories made available on Pointer (www.pointer.nl)☆10Updated 5 years ago
- all that favours real-time democracy☆12Updated 2 years ago
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆80Updated this week
- Collection of awesome SEO-related case studies☆14Updated 4 years ago
- This is a basic instance of the D-Net software toolkit, a software framework for the realization of aggregative data infrastructures.☆15Updated 3 years ago
- Ricgraph - Research in context graph☆27Updated this week