CitizensFoundation / pace-keyword-scannerLinks
CommonCrawl keyword scanner. Time for month of CC data on EC2 c5.18xlarge instance for hundreds of keywords takes about 3 hours. LLM (BERT) based 2nd level filtering. Developed with support from the EU and the Populism & Civic Engagement H2020 project.
☆15Updated 2 years ago
Alternatives and similar repositories for pace-keyword-scanner
Users that are interested in pace-keyword-scanner are comparing it to the libraries listed below
Sorting:
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 7 years ago
- Collection of awesome SEO-related case studies☆15Updated 4 years ago
- Track changes to GraphQL APIs by git scraping their schemas☆28Updated 2 months ago
- Helps you to visualize the site structure☆9Updated 2 years ago
- Vector Embedding Markup Language - markup language designed specifically for annotating and structuring data related to vector embeddings…☆12Updated last year
- Introduction to OpenDolphin - A centralized, open source, unbiased and secure social network built by the community, for the community.☆10Updated 2 years ago
- A Google Trends Analytics Package☆13Updated last year
- ☆24Updated last year
- A Command line interface that allows you to manage the back end of your self hosted typesense server. Builds on top of the typesense js l…☆16Updated last year
- A Fediverse robot account that posts the latest public records requests filed and completed at muckrock.com☆14Updated last week
- Blacklight is a powerful secret, keys and sensitive data scanning tool that helps you detect and prevent sensitive information leaks in y…☆13Updated last month
- List of privacy-friendly analytics solutions☆20Updated 2 years ago
- Repository to allow collaboration between Cycle Labs Cloud community in support of the community.☆9Updated 3 years ago
- The Misinformation Game is a social-media simulator built to study how people interact with information on social-media.☆29Updated 3 months ago
- A collaborative collection of structured datasets and document collections that are common to use within "Follow the Money" investigation…☆13Updated 2 weeks ago
- This script fetches search queries and excludes those that have a negative sentiment.☆10Updated 6 years ago
- Email Enricher is a free, offline alternative to Clearbit for enriching emails. Determine if an email likely belongs to a Fortune 1000 co…☆17Updated last year
- Real-Time Proxy & Web Scraping API☆24Updated 5 years ago
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆24Updated 4 years ago
- ☆12Updated last year
- ☆12Updated 3 months ago
- Matomo plugin for Docusaurus v2/v3☆14Updated last year
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 5 years ago
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production …☆14Updated last year
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆23Updated last year
- 😎 A community-curated list of awesome lawtech software and learning resources for legal technology and design.☆26Updated 5 years ago
- A package to take the pain out when working with the Google Search Console Search Analytics Query API.☆11Updated last year
- GNewsScraper is a TypeScript package that scrapes article data from Google News based on a keyword or phrase. It returns the results as a…☆12Updated last year
- A list of awesome browser extensions to help ith SEO and rank higher!☆24Updated 4 years ago
- all that favours real-time democracy☆14Updated 2 years ago