sonarme / lukeLinks
DEPRECATED, since we cannot maintain this Luke repo any longer. Please fork / Luke fork for Lucene 4.3 (mavenized)
☆15Updated 4 years ago
Alternatives and similar repositories for luke
Users that are interested in luke are comparing it to the libraries listed below
Sorting:
- Facilitates the indexing of content from a CSV into ElasticSearch☆26Updated 12 years ago
- ☆24Updated 12 years ago
- Focused Crawler for VT's CTRNet☆10Updated 12 years ago
- TAUS Dynamic Quality Framework API☆12Updated 5 years ago
- Grapheme to phoneme toolkit using joint-modelling + CRFs in java☆14Updated 7 years ago
- Morpha lex stemmer converted using jflex.☆24Updated 5 years ago
- Shell scripts to assist downloading & processing the Google n-grams corpora☆14Updated 8 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆12Updated 2 years ago
- (Labeled) Latent Dirichlet Allocation on a sentence level with Gibbs Sampling☆10Updated 11 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Updated 2 years ago
- Analysis plugin for ElasticSearch providing capability for processing inline annotations in documents.☆35Updated 11 years ago
- Language checker and hyphenator extension for LibreOffice☆12Updated 5 years ago
- Restful pipeline command support plugin for Elasticsearch☆33Updated 9 years ago
- This is an experimental project to enhance Optical Character Recognition technique to recognize text from natural images☆14Updated 11 years ago
- Term List Matching Plugin for ElasticSearch☆26Updated 11 years ago
- Latent Dirichlet Allocation with Gibbs sampling☆16Updated 11 years ago
- This repository contains the complete source code that we used to conduct experiments in the paper: Text Window Denoising Autoencoder: Bu…☆15Updated 12 years ago
- Morfessor FlatCat☆13Updated 6 years ago
- LPC, vowels, formants. A repo to save my research on this topic☆21Updated 7 years ago
- Brand disambiguator for tweets to differentiate e.g. Orange vs orange (brand vs foodstuff), using NLTK and scikit-learn☆58Updated 12 years ago
- Aelius is a suite of Python, NLTK-based modules and language data for training and evaluating POS-taggers for Brazilian Portuguese and an…☆19Updated 13 years ago
- tgrep2 Searching for NLTK Trees☆15Updated 9 years ago
- Parser for KAF NAF files written in Python☆16Updated 4 years ago
- A package in C++ for character or word ngram analysis. It uses Ternary Search Tree instead of hashing table for faster ngram frequency co…☆20Updated 10 years ago
- Web frontend for Myria☆11Updated 5 years ago
- Base components for Question Answering pipelines☆28Updated 3 years ago
- ☆16Updated 13 years ago
- Full text extraction using the Open Source Tesseract OCR software https://code.google.com/p/tesseract-ocr/ and imagemagick☆13Updated 10 years ago
- Text Detection and Recognition in Video☆11Updated 11 years ago
- ☆12Updated 4 months ago