Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends
☆57Jan 28, 2024Updated 2 years ago
Alternatives and similar repositories for KeywordAnalysis
Users that are interested in KeywordAnalysis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- API - extract a list of keywords from a text.☆18Jul 6, 2017Updated 8 years ago
- LinkRun - Data Engineering project done in 3 weeks during the Insight fellowship☆38Apr 2, 2020Updated 6 years ago
- An attempt to use financial news to predict stock market☆16Nov 17, 2018Updated 7 years ago
- Extraction code used to create the Dresden Web Table Corpus☆14Feb 25, 2015Updated 11 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Jan 16, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Single-threaded epoll-based concurrent bulk whois client☆31Oct 31, 2017Updated 8 years ago
- ☆15Aug 15, 2012Updated 13 years ago
- ☆19Dec 19, 2018Updated 7 years ago
- Detecting Trends in Job Advertisements☆20Aug 13, 2018Updated 7 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Oct 3, 2023Updated 2 years ago
- CommonCrawl WARC/WET/WAT examples and processing code for Java + Hadoop☆38Mar 12, 2026Updated 2 months ago
- It is a Chrome extension, an alternative to ChatGPT. It is free and no data leaves your computer. Powered by WebLLM.☆16Mar 4, 2024Updated 2 years ago
- Use Python to Automate the PowerPoint Update☆15May 28, 2023Updated 2 years ago
- text analysis with ngrams for nodejs☆23Dec 6, 2011Updated 14 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A helpful package that helps you access shell & shell-based applications via web application☆16Jul 25, 2023Updated 2 years ago
- European Parliament website Python scraper☆12Oct 19, 2016Updated 9 years ago
- Convert powerpoint (pptx) files into raw text org or LaTeX files☆15Aug 28, 2018Updated 7 years ago
- Code for the paper "Benchmarking sentiment analysis methods for large-scale texts: A case for using continuum-scored words and word shift…☆16Jun 8, 2017Updated 8 years ago
- Extract images from PowerPoint files☆17Dec 1, 2011Updated 14 years ago
- Scripts for building a geo-located web corpus using Common Crawl data☆11Jan 18, 2026Updated 4 months ago
- Automated generation of powerpoint slides for fun and profit☆13Oct 18, 2017Updated 8 years ago
- Break words and phrases into ngrams.☆12Dec 12, 2013Updated 12 years ago
- Простая обертка на языке Python для яндексового Tomita Parser'а (больше не нужна, Яндекс открыл исходники)☆17Nov 26, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Content Extraction using the PageRank algorithm to find the element containing the best content.☆13Aug 14, 2019Updated 6 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Sep 5, 2012Updated 13 years ago
- Generate epicycles to Fourier Transform 2D images☆13Jun 10, 2025Updated 11 months ago
- A (massive) DNS tools (reverse lookup for now)☆12Jul 6, 2022Updated 3 years ago
- create concept map from textbook data☆11May 4, 2018Updated 8 years ago
- Miqra According to the Masorah in two JSON formats☆12Updated this week
- Code for reconstructing full-text news articles from the GDELT Web News NGrams 3.0 dataset☆28Apr 28, 2026Updated 3 weeks ago
- This is the facade for installation and access to the individual components☆16Apr 8, 2026Updated last month
- A Supervised Approach To The Interpretation Of Imperative To-Do Lists☆12Jun 29, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Statistics of Common Crawl monthly archives mined from URL index files☆221Updated this week
- Dockerized Rails and React Example☆11Feb 1, 2024Updated 2 years ago
- ☆13Jun 14, 2016Updated 9 years ago
- Save yourself from 'Death by PowerPoint'☆15Feb 18, 2020Updated 6 years ago
- Deploy ProcessGroup into your LIVE NiFi data-flow.☆13Oct 14, 2016Updated 9 years ago
- ☆15Dec 2, 2019Updated 6 years ago
- [DEPRECATED] Baseline Project for Semantic Searching☆10Oct 15, 2018Updated 7 years ago