korpling / pepperLinks
A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used stand-alone as a command line interface, or be integrated as an API into other software products.
☆24Updated 8 months ago
Alternatives and similar repositories for pepper
Users that are interested in pepper are comparing it to the libraries listed below
Sorting:
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆75Updated 3 months ago
- Multi Tier Annotation Search☆26Updated 4 years ago
- Named entity annotation tool☆28Updated 2 years ago
- Text-Induced Corpus Clean-up☆20Updated 2 years ago
- FairCopy is a word processor for the humanities scholar.☆10Updated this week
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 4 years ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆27Updated 3 years ago
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Updated 3 years ago
- Multi Tier Annotation Search☆12Updated last year
- A repository for illustrating the transformation of a PAGE XML file into XML-TEI format, resulting from experimentations made for the LEC…☆16Updated 3 years ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆18Updated 3 weeks ago
- ☆13Updated this week
- ☆11Updated 5 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 6 years ago
- Named Entity Recognition☆18Updated 5 months ago
- Data space of the DARIAH Lexical Resources Working Group☆21Updated 3 months ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 3 years ago
- DTA Base Format (DTABf)☆18Updated 6 months ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆65Updated last year
- Guidelines for software quality & sustainability (CLARIAH WP2 task 54.100)☆16Updated 3 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Updated 7 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated this week
- A Python database interface for eXist-db☆14Updated 2 weeks ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆37Updated 2 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 3 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Updated last year
- A simple configurable tool for manipulating dependency trees.☆14Updated 8 months ago
- High-performance text aligner for large collections of texts☆52Updated this week