santhoshtr / sfst
Stuttgart Finite State Transducer system
☆18Updated 4 months ago
Alternatives and similar repositories for sfst:
Users that are interested in sfst are comparing it to the libraries listed below
- Lexical data at Unicode☆67Updated 6 months ago
- A LibreOffice extension that converts JabRef references to plain text code and vice versa so that you can use your references with MS Off…☆11Updated 6 months ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆17Updated 2 weeks ago
- Python Unicode Block Utilities☆24Updated 4 years ago
- WordNet-LMF formats☆21Updated 2 weeks ago
- tesseractXplore a tesseract ease of use gui with full control☆22Updated 3 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆13Updated 5 years ago
- User contributed (non Google) OCR models for Tesseract☆24Updated 4 months ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Faster, modernized fork of the language identification tool langid.py☆54Updated 3 months ago
- Measure the similarity of text corpora for 74 languages☆13Updated last year
- Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)☆17Updated 6 months ago
- universal tokenizer☆15Updated 3 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆62Updated 9 months ago
- Loadable spellfix1 extension for sqlite as python package☆26Updated 10 months ago
- rasactl deploys Rasa X / Enterprise on your local or remote Kubernetes cluster and manages Rasa X / Enterprise deployments.☆15Updated 2 years ago
- Tools for TICCL☆14Updated 2 months ago
- An efficient data structure for fast string similarity searches☆22Updated 4 years ago
- Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an…☆18Updated 3 months ago
- Indri search implementation on top of Lucene search engine☆34Updated 11 months ago
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Updated last year
- search interface for scholarly works☆84Updated 7 months ago
- Authoring tool for interactive content☆17Updated this week
- A sentence segmentation library with wide language support optimized for speed and utility.☆58Updated 6 months ago
- Web front end for WikDict dictionaries☆17Updated 3 weeks ago
- Framework for creating and accessing UBY resources – sense-linked lexical resources in standard UBY-LMF format☆22Updated 6 years ago
- 🌸 Train floret vectors☆18Updated last year
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13Updated 5 months ago
- Natural Language Inflection in English☆11Updated 3 years ago