A Corpus Data Retrieval Index using Lucene for Look-Ups
☆19Feb 24, 2026Updated last week
Alternatives and similar repositories for Krill
Users that are interested in Krill are comparing it to the libraries listed below
Sorting:
- Translation of query languages to serialized KoralQuery protocol☆13Feb 23, 2026Updated last week
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- Multi Tier Annotation Search☆12May 13, 2024Updated last year
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Dec 2, 2024Updated last year
- A tiny graph database engine written in C☆10May 9, 2014Updated 11 years ago
- ☆11Feb 13, 2026Updated 2 weeks ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆122Updated this week
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆18Jan 13, 2026Updated last month
- A package in C++ for character or word ngram analysis. It uses Ternary Search Tree instead of hashing table for faster ngram frequency co…☆20May 11, 2015Updated 10 years ago
- Coquery is a free corpus query tool for linguists, lexicographers, translators, and anybody who wishes to search and analyse a text corpu…☆19Jan 7, 2026Updated last month
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆23Apr 23, 2024Updated last year
- Korpuslinguistik war noch nie so einfach...☆25Feb 18, 2026Updated last week
- Python wrapper for the CWB to extract concordances and score frequency lists☆22Jan 12, 2026Updated last month
- A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used st…☆24Jan 3, 2025Updated last year
- OStatus Specification☆20Apr 11, 2015Updated 10 years ago
- Multi Tier Annotation Search☆26May 12, 2021Updated 4 years ago
- Morphological analyzer and lemmatizer for Latin.☆29Dec 10, 2025Updated 2 months ago
- Platform-independent versions of Pleiades gazetteer data☆39Updated this week
- The Zurich Dependency Parser for German☆89Aug 27, 2025Updated 6 months ago
- What happens when you connect all the ZIP/postal codes in a country in ascending order?☆13Sep 25, 2024Updated last year
- Gazetteer of the Ancient Near East Data☆10Aug 1, 2013Updated 12 years ago
- A PHP library for comparing two or more Sanskrit TEI XML files and generating an apparatus with variants☆14Feb 16, 2026Updated 2 weeks ago
- A framework, data and configs for generating and building Tesseract OCR lang.traineddata model files, specifically for Japanese☆10Dec 9, 2013Updated 12 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Oct 14, 2022Updated 3 years ago
- Scraper for German democracy documents☆44Sep 12, 2023Updated 2 years ago
- Lightweight, multilingual natural language processing☆63Apr 8, 2013Updated 12 years ago
- Redis tcp map for postfix☆12Jun 28, 2024Updated last year
- A lexical normalizer for historical spelling variants using a transformer architecture.☆10Mar 12, 2025Updated 11 months ago
- Grecka is a python script to convert Greek to Greeklish based on ELOT 743☆12Aug 4, 2018Updated 7 years ago
- "Save as DAISY" add-in for Microsoft Word☆10Dec 22, 2025Updated 2 months ago
- Focused Crawler for VT's CTRNet☆10May 13, 2013Updated 12 years ago
- TEI-encoded contents of the Egyptian Gazette☆15Jun 11, 2024Updated last year
- Madek main web interface☆21Updated this week
- Oracc GUI☆12Jun 27, 2025Updated 8 months ago
- A Simple Sudoku Solver☆23Nov 26, 2012Updated 13 years ago
- Speech ANDroid Apps☆20Jan 22, 2014Updated 12 years ago
- Project to digitize avant-garde periodicals☆12May 13, 2022Updated 3 years ago
- An expandable and scalable OCR pipeline☆89Nov 14, 2017Updated 8 years ago
- Tool for interacting with the reMarkable lines format and API☆13Jul 1, 2021Updated 4 years ago