PreferredAI / venom
Your preferred open source focused crawler for the deep web.
☆73Updated last year
Alternatives and similar repositories for venom:
Users that are interested in venom are comparing it to the libraries listed below
- Your personalized retrieval engine☆28Updated 3 years ago
- A basic web crawler example☆9Updated 4 years ago
- A Topic Model for Document Comparison☆13Updated 5 years ago
- A tutorial on scalable retrieval of matrix factorization recommendations☆26Updated 5 years ago
- Tools and other things for people who work on search relevance & information retrieval☆84Updated last year
- This repository contains the core model we called "Collaborative filtering enhanced Content-based Filtering" published in our UMUAI artic…☆12Updated 5 years ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆46Updated 3 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 6 years ago
- Suite of tools for detecting changes in web pages and their rendering☆54Updated last year
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 3 years ago
- Search relevance evaluation toolkit☆31Updated 2 years ago
- This repository for Web Crawling, Information Extraction, and Knowledge Graph build up.☆33Updated 6 years ago
- Optimal distributed data deduplication and supervised learning pipeline using Apache Spark☆10Updated 4 years ago
- Common Crawl fork of Apache Nutch☆29Updated last week
- Web page segmentation and noise removal☆55Updated 11 months ago
- Vector Plugin for Solr: calculate dot product / cosine similarity on documents☆34Updated 4 years ago
- Zulia Search Engine☆32Updated last week
- Cross online social network crawler to link users from Twitter, Instagram and Foursquare☆22Updated 8 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆85Updated 3 years ago
- learning related projects☆18Updated 9 years ago
- Deviant Spy is a native advertising (RevContent) spy tool☆30Updated 6 years ago
- PDF analysis. Convert contents of PDF to a JSON-style python dictionary.☆31Updated 2 years ago
- Collect and filter location information from social network services.☆9Updated 4 years ago
- Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆66Updated 4 years ago
- Samantha - A generic recommender and predictor server☆76Updated last year
- This is a new deep learning model for recommender system, which we called PHD☆33Updated 6 years ago
- Java-Based Context-aware Recommendation Library☆125Updated 2 years ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆25Updated 6 months ago
- The purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacio…☆62Updated 6 years ago
- The LAW next generation crawler.☆87Updated 3 years ago