korpling / pepper
A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used stand-alone as a command line interface, or be integrated as an API into other software products.
☆23Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pepper
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆69Updated this week
- Multi Tier Annotation Search☆26Updated 3 years ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆17Updated last month
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Updated last year
- Named entity annotation tool☆27Updated last year
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Text-Induced Corpus Clean-up☆20Updated last year
- LexInfo - Data Category Ontology for OntoLex-Lemon☆22Updated last year
- Software for multi-level annotation of linguistic corpora☆17Updated 4 years ago
- Specification of a stand-off element for the TEI guidelines☆12Updated 3 years ago
- Digitale Geisteswissenschaften rund um Graphentechnologien☆8Updated 6 months ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 2 years ago
- A tool for automatic spelling normalization☆20Updated 3 years ago
- Ontolex modules☆30Updated last year
- Named Entity Recognition☆16Updated this week
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆22Updated last year
- WordNet-LMF formats☆20Updated this week
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆60Updated 5 months ago
- Multi Tier Annotation Search☆12Updated 5 months ago
- Schema for modelling parliamentary debates☆21Updated 2 years ago
- Advanced graph rewriting and LLOD publication for CoNLL and other TSV formats☆25Updated 5 months ago
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Updated 2 years ago
- Framework for creating and accessing UBY resources – sense-linked lexical resources in standard UBY-LMF format☆22Updated 6 years ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 4 months ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- A simple configurable tool for manipulating dependency trees.☆13Updated 6 months ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated last year
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- ☆11Updated 4 years ago
- Towards a consolidated LOD vocabulary for linguistic annotations☆15Updated 2 years ago