SeldonIO / semantic-vectors-lucene-toolsLinks
Tools for building a Lucene index for Semantic Vectors
☆21Updated 9 years ago
Alternatives and similar repositories for semantic-vectors-lucene-tools
Users that are interested in semantic-vectors-lucene-tools are comparing it to the libraries listed below
Sorting:
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- System for mining Wikipedia Usage data to read our collective mind☆21Updated 10 years ago
- DKPro WSD: A Java framework for word sense disambiguation☆20Updated 2 years ago
- Seldon Spark Jobs☆26Updated 10 years ago
- ☆22Updated last year
- NLP toolkit (tokenizer, POS-tagger, parser, etc.)☆43Updated 8 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆65Updated 8 years ago
- A subgroup discovery tool that can use ontological domain knowledge (RDF graphs) in the learning process. Subgroup descriptions contain t…☆12Updated 7 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- ☆20Updated 8 years ago
- Ranking Entity Types using the Web of Data☆30Updated 8 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Updated 8 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34Updated 2 years ago
- Regularized latent variable mixed membership modeling☆13Updated 11 years ago
- Design algorithms for cross document coreference resolution☆17Updated 11 years ago
- Mention-anomaly-based event detection and tracking in Twitter☆17Updated 8 years ago
- A tool for calculation semantic similarity between words from a text corpus based on lexico-syntactic patterns.☆27Updated 9 years ago
- Vector search in Lucene based search attempting to use just the existing Lucene data structures (experimental)☆43Updated 5 years ago
- UIMA-based text classification framework built on top of DKPro Core and DKPro Lab.☆34Updated 2 years ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆12Updated 5 months ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- An HTTP proxy for Elasticsearch, Solr (etc.) to prevent a 100% full disk situation.☆11Updated 6 years ago
- MetroMaps Release☆16Updated 11 years ago
- Exploration Library in Java☆12Updated 2 years ago
- CrowdRec reference framework☆32Updated 8 years ago
- Python functions for popular relevance metrics (ndcg, err, etc)☆16Updated last year
- KnowledgeStore☆20Updated 7 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 3 years ago