CI-Research / KeywordAnalysis
Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends
☆56Updated last year
Alternatives and similar repositories for KeywordAnalysis:
Users that are interested in KeywordAnalysis are comparing it to the libraries listed below
- API - extract a list of keywords from a text.☆18Updated 7 years ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 3 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Exploring Common-Crawl using Python and DynamoDB☆33Updated 7 years ago
- Parsing resumes in a PDF format from linkedIn☆68Updated 8 years ago
- Source real estate prices from the Common Crawl.☆27Updated 6 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Scrapes sites. Gets news. Eventually events.☆85Updated 9 years ago
- keywords-extract - Command line tool extract keywords from any web page.☆63Updated 6 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- Watchman: An open-source social-media event-detection system☆21Updated 6 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆97Updated 2 years ago
- Keyword Extraction system using Brown Clustering - (This version is trained to extract keywords from job listings)☆17Updated 10 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆149Updated 2 months ago
- Extract social media links and account names from websites.☆38Updated 4 years ago
- A browser extension that lets you find email addresses for any domain with a single click.☆71Updated 7 years ago
- Using Scrapy to get company profiles from http://crunchbase.com☆31Updated 11 years ago
- Text analysis for automatic bookmarking/keyword extraction☆18Updated 8 years ago
- Web Page Inspection Tool UI. Google SERP Preview, Sentiment Analysis, Keyword Extraction, Named Entity Recognition & Spell Check☆24Updated 2 years ago
- This is the facade for installation and access to the individual components☆15Updated 6 years ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆45Updated 3 years ago
- Chrome extension that will scrape a linkedin profile.☆32Updated 2 years ago
- Aviation grade news article metadata extraction☆37Updated 2 years ago
- Console program to get global ranking for a given website or domain☆21Updated 2 years ago
- Scrapy pipeline which allows you to store scrapy items in a solr server.☆19Updated 8 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- Simple RSS feed reader for HackerNews.☆28Updated 2 years ago
- Suite of tools for detecting changes in web pages and their rendering☆54Updated last year
- Pipeline for distributed Natural Language Processing, made in Python☆64Updated 8 years ago
- Scraping Assisted by Learning☆35Updated this week