KorAP / Koral
Translation of query languages to serialized KoralQuery protocol
☆11Updated this week
Alternatives and similar repositories for Koral:
Users that are interested in Koral are comparing it to the libraries listed below
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated this week
- Multi Tier Annotation Search☆12Updated 9 months ago
- ☆28Updated this week
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆74Updated 2 weeks ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- Multi Tier Annotation Search☆26Updated 3 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used st…☆24Updated last month
- A web-based, token-level annotation tool for non-standard language data☆10Updated 4 years ago
- Thot toolkit for statistical machine translation☆50Updated 2 years ago
- A tool for automatic spelling normalization☆20Updated 4 years ago
- A simple configurable tool for manipulating dependency trees.☆13Updated last month
- Software for multi-level annotation of linguistic corpora☆17Updated 5 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated 2 months ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆111Updated 3 weeks ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆51Updated 4 years ago
- SKOS Support for Apache Lucene and Solr☆56Updated 3 years ago
- SKOS analysis for Elasticsearch☆54Updated 8 years ago
- NYT Risk Semantics Project☆12Updated 8 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆67Updated 2 weeks ago
- Virtual Language Observatory☆15Updated 5 months ago
- A Utility Library for Wikipedia dumps☆33Updated 7 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆61Updated 9 months ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- ☆22Updated 11 months ago
- Spectral Word Embedding Learning for Language (SWELL) toolkit☆27Updated 10 years ago
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Updated last year
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- UIMA-based text classification framework built on top of DKPro Core and DKPro Lab.☆34Updated 2 years ago