CitizensFoundation / pace-keyword-scannerLinks
CommonCrawl keyword scanner. Time for month of CC data on EC2 c5.18xlarge instance for hundreds of keywords takes about 3 hours. LLM (BERT) based 2nd level filtering. Developed with support from the EU and the Populism & Civic Engagement H2020 project.
☆17Updated 2 years ago
Alternatives and similar repositories for pace-keyword-scanner
Users that are interested in pace-keyword-scanner are comparing it to the libraries listed below
Sorting:
- Track changes to GraphQL APIs by git scraping their schemas☆30Updated 7 months ago
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆115Updated this week
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆24Updated 5 years ago
- Real-Time Proxy & Web Scraping API☆24Updated 6 years ago
- An open-source archive that gathers, saves, shares and analyzes news homepages☆148Updated last week
- A list of awesome browser extensions to help ith SEO and rank higher!☆24Updated 5 years ago
- Containerized workflow automation tool☆21Updated this week
- Datasette enrichment for analyzing row data using OpenAI's GPT models☆21Updated last year
- Penme is a lightweight open source note taking app focused on privacy!☆26Updated 5 years ago
- Scrape various open data directories to create an index of what's available out there☆37Updated 9 months ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 7 years ago
- Datasette plugin for uploading CSV files and converting them to database tables☆27Updated 3 weeks ago
- A helper library full of URL-related heuristics.☆72Updated 2 months ago
- Local SMTP desktop app for debugging and previewing your emails☆16Updated 9 months ago
- A case management app built with Lowdefy.☆32Updated last year
- A minimal client-side library to convert your vanilla URLs to deep links.☆19Updated 4 years ago
- Scrape HN to track links from specific domains☆68Updated this week
- ☆26Updated 4 years ago
- VFRAME: Visual Forensics and Metadata Extraction☆74Updated 2 years ago
- keywords-extract - Command line tool extract keywords from any web page.☆61Updated 7 years ago
- ☆25Updated 2 years ago
- A Command line interface that allows you to manage the back end of your self hosted typesense server. Builds on top of the typesense js l…☆16Updated 2 years ago
- Browser version of Hyphe (WIP)☆31Updated 6 months ago
- CLI utility to scrape emails from websites☆170Updated last week
- List of free and checked http, https, socks4 and socks5 proxies☆16Updated 2 weeks ago
- Architectural design records, technical notes, and issues for the digital land project☆29Updated 5 months ago
- Easily build and maintain any kind of contract. Free and Open Source☆98Updated 8 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated last year
- Datasette plugin for rendering HTML based on JSON values☆28Updated 3 years ago
- Interface for Google Trends time series☆12Updated 3 years ago