santhoshtr / sfstLinks
Stuttgart Finite State Transducer system
☆23Updated 3 months ago
Alternatives and similar repositories for sfst
Users that are interested in sfst are comparing it to the libraries listed below
Sorting:
- Crop And Splice Segments (of scanned pages)☆14Updated 6 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated last year
- Targetted language identifier, based on FastText and Hunspell.☆37Updated 2 months ago
- ☆17Updated 4 months ago
- Lexical data at Unicode☆70Updated last year
- Link Wikidata items to large catalogs☆96Updated last month
- Wikidata authority file mapping tool☆11Updated 7 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated last year
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆18Updated last month
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 6 years ago
- An efficient data structure for fast string similarity searches☆22Updated 4 years ago
- Clone of https://gitlab.com/scripta/escriptorium.git☆30Updated 3 weeks ago
- Named entity annotation tool☆28Updated 2 years ago
- tesseractXplore a tesseract ease of use gui with full control☆24Updated 4 years ago
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆68Updated 2 years ago
- Python tools for interacting with Wikidata☆156Updated 2 years ago
- Shobhika is a Devanāgarī font for scholars.☆48Updated 6 years ago
- View HOCR files with Mirador☆29Updated 8 years ago
- Tools for working with book data☆18Updated last month
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 3 years ago
- Glyph Miner, a system for extracting glyphs from early typeset prints☆34Updated 9 years ago
- Perpetual Access To The Scholarly Record☆120Updated last year
- Demonstration of searching PDF document with Solr, Tika, and Tesseract☆32Updated last year
- PAWS: A Web Shell☆53Updated last week
- Named entity recognition for the legal domain☆42Updated 4 years ago
- Tools for TICCL☆14Updated 2 months ago
- WordNet-LMF formats☆24Updated 4 months ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- Library of Congress coding standards☆31Updated last year
- QA-tool for scans with corresponding ALTO-files☆26Updated 2 years ago