VIDA-NYU / domain_discovery_toolLinks
This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better understand a domain (or topic) as it is represented on the Web.
☆45Updated 3 years ago
Alternatives and similar repositories for domain_discovery_tool
Users that are interested in domain_discovery_tool are comparing it to the libraries listed below
Sorting:
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆83Updated 5 years ago
- Quickly analyze and explore email with advanced analytics and visualization.☆56Updated 3 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 2 years ago
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery☆56Updated last year
- General Architecture for Text Engineering☆50Updated 9 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Updated 9 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Simple taxonomy management tool and document classifier.☆56Updated 5 years ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Updated 7 years ago
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆268Updated 2 years ago
- Cuttlefish aims to be a highly extensible visualization and analysis platform for all kinds of network data☆18Updated 7 years ago
- A list of memex-related tools and their repository URLs☆151Updated 7 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆68Updated last year
- This repository for Web Crawling, Information Extraction, and Knowledge Graph build up.☆33Updated 7 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Updated 3 years ago
- Browser add-on and web server to support collection and analysis of web browsing data.☆13Updated 9 years ago
- Deployment of pywb as a CommonCrawl Index Server☆21Updated 7 years ago
- Browser version of Hyphe (WIP)☆31Updated 2 months ago
- Universal backend for indexing, storing, and querying documents.☆25Updated 5 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34Updated 2 years ago
- Installer for Thymeflow, a personal knowledge management system.☆33Updated 7 years ago
- Record Linkage ToolKit (Find and link entities)☆110Updated last year
- Extraction Toolkit☆83Updated 3 years ago
- 📦 The Knowledge Box - A data dependency management framework to help users to publish, find and install data models☆46Updated this week
- Code and templates required to build the DARPA open catalog.☆17Updated 9 years ago
- TypeDB Driver Example Projects and Tutorials☆86Updated last year
- An open source search engine written in C/C++ for Linux on Intel/AMD. From gigablast dot com. See the README.md file below for instructio…☆26Updated 7 years ago
- This page is a companion for the paper titled Towards Automatic Structuring and Semantic Indexing of Legal Documents☆29Updated 6 years ago