hlavki / jlemmagen
Java implmentation of LemmaGen project
☆10Updated 3 years ago
Alternatives and similar repositories for jlemmagen:
Users that are interested in jlemmagen are comparing it to the libraries listed below
- This plugin provides a useful feature for multi-language☆14Updated 2 years ago
- NERC-fr: Supervised Named Entity Recognition for French☆14Updated 9 years ago
- small Java library for splitting German compound words☆63Updated 11 months ago
- SOLR bulk indexing utility for the command line.☆45Updated 3 weeks ago
- Detect the language of text☆36Updated 4 years ago
- Text similarity based on Word2Vec vectors.☆11Updated 8 years ago
- Solr Query Segmenter for structuring unstructured queries☆21Updated 3 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- XQuery wrapper around the Stanford CoreNLP pipeline☆12Updated last year
- Apache OpenNLP Sandbox☆42Updated this week
- A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used st…☆24Updated 3 months ago
- An implementation of the Watset clustering algorithm in Java.☆29Updated 2 years ago
- This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache …☆26Updated 2 years ago
- A Java UIMA-based toolbox for multilingual and efficient terminology extraction an multilingual term alignment☆40Updated 7 years ago
- Small scripts for processing Solr files☆10Updated last year
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 4 years ago
- Visualization of result returning by Solr 6 graph query☆10Updated 8 years ago
- Geographic Place, Date/time, and Pattern entity extraction toolkit along with text extraction from unstructured data and GIS outputters.☆44Updated 2 months ago
- The OpenSextant Gazetteer is a collection of world-wide place name data☆12Updated 7 years ago
- ☆18Updated 9 years ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 6 years ago
- Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr☆20Updated 3 years ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆26Updated last month
- Deprecated Module: See Xponents or OpenSextantToolbox as active code base.☆31Updated 11 years ago
- A system for unsupervised knowledge-free interpretable word sense disambiguation based on distributional semantics☆19Updated 7 years ago
- Raw Wikipedia counts for entity linking☆19Updated 7 years ago
- SKOS Support for Apache Lucene and Solr☆56Updated 3 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 2 years ago
- Multilingual automatic text summarizer using statistical approach and extraction☆34Updated 5 years ago
- Advanced desktop search/corpus exploration prototype☆21Updated 3 years ago