santhoshtr / sfstLinks
Stuttgart Finite State Transducer system
☆23Updated 4 months ago
Alternatives and similar repositories for sfst
Users that are interested in sfst are comparing it to the libraries listed below
Sorting:
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated this week
- Lexical data at Unicode☆70Updated last year
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13Updated 7 months ago
- Tools for TICCL☆14Updated this week
- Crop And Splice Segments (of scanned pages)☆14Updated 6 years ago
- ☆17Updated 2 weeks ago
- OCRopus model for Gothic print (Fraktur)☆19Updated 5 years ago
- An efficient data structure for fast string similarity searches☆22Updated 4 years ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆21Updated 6 months ago
- In-browser OCR of Ancient Greek and Latin☆26Updated 2 months ago
- Named entity annotation tool☆28Updated 2 years ago
- Flask Interface to Thompson's Motif Index☆18Updated 6 years ago
- tesseractXplore a tesseract ease of use gui with full control☆24Updated 4 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- Glyph Miner, a system for extracting glyphs from early typeset prints☆34Updated 9 years ago
- Clone of https://gitlab.com/scripta/escriptorium.git☆30Updated last month
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆56Updated 4 years ago
- QA-tool for scans with corresponding ALTO-files☆26Updated 3 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆58Updated 2 months ago
- WordNet-LMF formats☆24Updated 2 weeks ago
- Ergonomic line-by-line transcription of scanned text.☆54Updated 4 years ago
- A sentence segmentation library with wide language support optimized for speed and utility.☆73Updated 3 weeks ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 10 months ago
- Link Wikidata items to large catalogs☆96Updated last month
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆18Updated 2 months ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Python tools for interacting with Wikidata☆158Updated 2 years ago
- View HOCR files with Mirador☆29Updated 8 years ago
- 🌸 Train floret vectors☆18Updated 2 years ago
- Named entity recognition for the legal domain☆42Updated 4 years ago