Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends
☆57Jan 28, 2024Updated 2 years ago
Alternatives and similar repositories for KeywordAnalysis
Users that are interested in KeywordAnalysis are comparing it to the libraries listed below
Sorting:
- API - extract a list of keywords from a text.☆18Jul 6, 2017Updated 8 years ago
- LinkRun - Data Engineering project done in 3 weeks during the Insight fellowship☆39Apr 2, 2020Updated 5 years ago
- ☆16Aug 15, 2012Updated 13 years ago
- Single-threaded epoll-based concurrent bulk whois client☆31Oct 31, 2017Updated 8 years ago
- Extraction code used to create the Dresden Web Table Corpus☆14Feb 25, 2015Updated 11 years ago
- Detecting Trends in Job Advertisements☆20Aug 13, 2018Updated 7 years ago
- Community driven landing page generator for open source projects☆15Jan 25, 2016Updated 10 years ago
- Python script to split PDF files into separate files based on bookmarks☆16Jan 21, 2022Updated 4 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Jan 16, 2022Updated 4 years ago
- Problem Statement: Given a particular PDF/Text document ,How to extract keywords and arrange in order of their weightage using Python?☆21Jan 17, 2022Updated 4 years ago
- Corpus of domain names scraped from Common Crawl and manually annotated to add word boundaries (e.g. "commoncrawl" to "common crawl").☆20Jun 16, 2025Updated 8 months ago
- Wikipedia-based keyword extraction tool in Java☆21May 11, 2015Updated 10 years ago
- An entity linking prototype, developed using the datasets from the TAC-KBP sub-task.☆28Apr 5, 2017Updated 8 years ago
- This Python code scrapes Google search results then applies sentiment analysis, generates text summaries, and ranks keywords.☆29Feb 14, 2021Updated 5 years ago
- English name parser☆33Oct 12, 2024Updated last year
- Awesome list of the software tools related to opendata: data catalogs, ingestion tools, data prep tools and so on☆35Oct 28, 2025Updated 4 months ago
- Node project to collect Posts, Like, Comments, Follows and Following stats from Instagram profiles without signing for their API☆12Mar 25, 2024Updated last year
- jQuery based exit popup model -☆12Jan 30, 2017Updated 9 years ago
- ☆10Nov 20, 2024Updated last year
- ☆17Jun 7, 2023Updated 2 years ago
- European Parliament website Python scraper☆12Oct 19, 2016Updated 9 years ago
- Repository for the mijn.amsterdam.nl portal☆11Updated this week
- Tooling used for Binance DEX simulation trading competition on Binance testnet☆13Mar 22, 2019Updated 6 years ago
- Application of Blockchain in Crop Farming and Crop Supply☆10May 15, 2018Updated 7 years ago
- LD-Explorer is the missing tool for exploring, federating and querying linked data resources directly from the browser☆19Updated this week
- Experimental AGS data fotmat tool in python☆12Oct 17, 2018Updated 7 years ago
- Process Common Crawl data with Python and Spark☆452Jan 20, 2026Updated last month
- Analytics tool that applies Natural Language Processing (NLP) and Machine Learning (ML), such as concept extraction, idea classification,…☆10Dec 7, 2022Updated 3 years ago
- Bootyman deploys and manages large-scale Laravel SaaS applications in self-contained VMs in cloud☆11Jan 3, 2023Updated 3 years ago
- Experimental implementation of regions in WebVTT building on Anne's WebVTT parser.☆14Oct 19, 2014Updated 11 years ago
- Tool to identify domains containing Pinyin language☆12Oct 18, 2014Updated 11 years ago
- Mirror of Apache Apex site☆10Apr 29, 2025Updated 10 months ago
- Building Business Solutions with PowerApps and the Power Platform☆11Jan 30, 2020Updated 6 years ago
- An experimental distributed map reduce system based on Google's MapReduce, written in Rust!☆10Aug 3, 2022Updated 3 years ago
- CBE 30338 Chemical Process Control☆14Feb 27, 2024Updated 2 years ago
- Scholarly Big Data Subject Category Classifier☆10Jul 15, 2019Updated 6 years ago
- A scheduler to manage a multi tool dual arm robot while avoiding arm-to-arm collisions; considering complex side constraints; and optimiz…☆11Jul 6, 2021Updated 4 years ago
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- A tutoring app solves a real problem for students — to find an affordable and knowledgeable tutor on-demand. The design is based on a tw…☆10Nov 18, 2017Updated 8 years ago