korpling / pepperLinks
A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used stand-alone as a command line interface, or be integrated as an API into other software products.
☆24Updated 6 months ago
Alternatives and similar repositories for pepper
Users that are interested in pepper are comparing it to the libraries listed below
Sorting:
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆75Updated last month
- Multi Tier Annotation Search☆26Updated 4 years ago
- Text-Induced Corpus Clean-up☆20Updated 2 years ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆17Updated last month
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆65Updated last year
- A tool for automatic spelling normalization☆20Updated 4 years ago
- Named entity annotation tool☆28Updated 2 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Updated 2 years ago
- A simple configurable tool for manipulating dependency trees.☆13Updated 6 months ago
- All ontologies used in NIF 2.0 (NIF-Core + vocabulary modules + helper modules)☆37Updated 8 years ago
- Digitale Geisteswissenschaften rund um Graphentechnologien☆8Updated 4 months ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 2 years ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- Software for multi-level annotation of linguistic corpora☆17Updated 5 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆27Updated 3 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated this week
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- A web-based, token-level annotation tool for non-standard language data☆10Updated 4 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆69Updated 3 weeks ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated last year
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 5 months ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆112Updated this week
- The Open Multilingual Wordnet☆61Updated last year
- Open-source tools for morphological tagging, segmentation and stemming.☆40Updated 6 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆56Updated last week
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated last year
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Updated 3 years ago
- Parser for KAF NAF files written in Python☆16Updated 4 years ago