BernhardWenzel / google-taxonomy-matcher
Matches a category of Google's Taxonomy to product that is described in any kind of text data
☆60Updated 6 years ago
Alternatives and similar repositories for google-taxonomy-matcher:
Users that are interested in google-taxonomy-matcher are comparing it to the libraries listed below
- Google Cloud Storage connector, pre-processor and model for predicting user search intent based on keywords☆25Updated 5 years ago
- classify a job description (or noisy job title) into a ONET job title☆18Updated 8 years ago
- Keywords enrichment by autocompletion (AWS, PM, RDC, CDS, ...), google suggestion scraping Heavy multithreaded semantic corpus crawler S…☆12Updated 9 years ago
- API - extract a list of keywords from a text.☆18Updated 7 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated 11 months ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 3 years ago
- Web page segmentation and noise removal☆55Updated 11 months ago
- A Docker Wrapper to make the machine easily learn any language on top of INRIA OSCAR dataset using GPT2☆10Updated 4 years ago
- This repository for Web Crawling, Information Extraction, and Knowledge Graph build up.☆33Updated 6 years ago
- Content Extraction using the PageRank algorithm to find the element containing the best content.☆12Updated 5 years ago
- Keyword Extraction system using Brown Clustering - (This version is trained to extract keywords from job listings)☆18Updated 10 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- Parsing resumes in a PDF format from linkedIn☆68Updated 8 years ago
- A simple algorithm for clustering web pages, suitable for crawlers☆34Updated 7 years ago
- Hidden alignment conditional random field for classifying string pairs.☆25Updated 3 months ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- ☆32Updated 6 years ago
- Node.js application to extract the knowledge represented in Google infoboxes (aka Google Knowlege Graph Panel)☆26Updated 7 years ago
- LinkRun - Data Engineering project done in 3 weeks during the Insight fellowship☆38Updated 4 years ago
- Neural network based lemmatizer for Finnish language☆11Updated 4 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- Cloud crawler functions for scrapeulous☆44Updated 3 years ago
- Prodigy thing(z)☆13Updated 6 years ago
- Text analysis for automatic bookmarking/keyword extraction☆18Updated 8 years ago
- SEMRush SERP Tutorial. Using advertools to Extract and Analyze Search Engine Results Pages Data☆14Updated 6 years ago
- ☆11Updated 4 years ago
- A web application for real-time machine learning and sentiment analysis on Tweets☆43Updated 7 years ago
- Classifies webpages into categories defined in DMOZ dataset☆41Updated 9 years ago
- Source real estate prices from the Common Crawl.☆27Updated 6 years ago