VIDA-NYU / domain_discovery_tool_deprecatedView external linksLinks
Seed acquisition tool to bootstrap focused crawlers
☆23Apr 24, 2017Updated 8 years ago
Alternatives and similar repositories for domain_discovery_tool_deprecated
Users that are interested in domain_discovery_tool_deprecated are comparing it to the libraries listed below
Sorting:
- Tools for scraping of twitter data, conversion, text analysis and graph construction☆11Aug 1, 2016Updated 9 years ago
- ☆44Jan 15, 2016Updated 10 years ago
- This project deals with hierarchical classification of web pages based on dmoz dataset.☆14Apr 10, 2014Updated 11 years ago
- ☆21Jan 23, 2016Updated 10 years ago
- Scraper built with Scrapy.☆18Aug 14, 2024Updated last year
- ☆12Apr 7, 2015Updated 10 years ago
- Code and templates required to build the DARPA open catalog.☆17Mar 23, 2016Updated 9 years ago
- Vizlinc☆15Jan 14, 2016Updated 10 years ago
- Pattern-of-Behavior Search Tool☆11Jun 20, 2022Updated 3 years ago
- Faceted search engine for domain-specific exploration of the Web☆45Feb 10, 2017Updated 9 years ago
- General Architecture for Text Engineering☆49Mar 23, 2016Updated 9 years ago
- MITIE: library and tools for information extraction☆29Jan 22, 2015Updated 11 years ago
- ☆13Nov 30, 2015Updated 10 years ago
- a simple crawler framework☆56May 28, 2015Updated 10 years ago
- The Suspicious Email Submitter is a discontinued browser extension (Chrome, Chromium, Firefox) for the easy submission of suspicious emai…☆15Mar 6, 2023Updated 2 years ago
- Aperture-Tiles uses familiar web-based map interactions to allow exploration of arbitrary huge data sets.☆74May 23, 2023Updated 2 years ago
- Next generation graph processing platform☆12Aug 26, 2016Updated 9 years ago
- ☆23Mar 7, 2015Updated 10 years ago
- ☆20Mar 31, 2017Updated 8 years ago
- The User Activity Logging Engine, or User-ALE, is a logging mechanism used to quantitatively assess the behavioural and cognitive state o…☆13Aug 26, 2016Updated 9 years ago
- ☆25Jan 26, 2016Updated 10 years ago
- Classifier for predicting user interests based on Twitter profile and using Python library scikit-learn.☆31Jun 7, 2013Updated 12 years ago
- ☆18Jun 8, 2018Updated 7 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15May 2, 2015Updated 10 years ago
- Hybrid Question Answering (HAWK) -- is going to drive forth the OKBQA vision of hybrid question answering system using Linked Data and fu…☆16Oct 4, 2022Updated 3 years ago
- For interacting with nutch via Python☆29Updated this week
- Viewers for statistics and dashboarding of Domain Search Engine data☆126Jan 19, 2016Updated 10 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Sep 30, 2016Updated 9 years ago
- LINKED DATA QUALITY REPORTS☆41May 20, 2022Updated 3 years ago
- A contextual news development environment.☆49Dec 19, 2014Updated 11 years ago
- Python module for the parallel dbscan based on NWU code☆18Aug 14, 2024Updated last year
- Formasaurus tells you the type of an HTML form and its fields using machine learning☆119Updated this week
- ☆20Nov 1, 2017Updated 8 years ago
- Pikes is a Knowledge Extraction Suite☆23Nov 14, 2023Updated 2 years ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Feb 8, 2026Updated last week
- A Topic Modeling toolbox☆92Apr 26, 2016Updated 9 years ago
- Library for Geo-Inferencing in Twitter Data☆28Jun 10, 2016Updated 9 years ago
- A test tool to evaluate the conformity of medical devices with the ISO/IEEE 11073 SDC standard family.☆10Jun 18, 2025Updated 7 months ago
- An Exploration into Graph Databases☆28Oct 7, 2015Updated 10 years ago