santhoshtr / sfstLinks
Stuttgart Finite State Transducer system
☆23Updated 5 months ago
Alternatives and similar repositories for sfst
Users that are interested in sfst are comparing it to the libraries listed below
Sorting:
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated last month
- Lexical data at Unicode☆70Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆18Updated last week
- Tools for TICCL☆14Updated last month
- Wikidata authority file mapping tool☆11Updated 7 years ago
- search interface for scholarly works☆85Updated last year
- 🌸 Train floret vectors☆18Updated 2 years ago
- Python tools for interacting with Wikidata☆160Updated 2 years ago
- Glyph Miner, a system for extracting glyphs from early typeset prints☆34Updated 9 years ago
- Tools for working with book data☆18Updated last month
- A python module for word inflections designed for use with spaCy.☆93Updated 5 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- Flask Interface to Thompson's Motif Index☆18Updated 6 years ago
- Crop And Splice Segments (of scanned pages)☆14Updated 6 years ago
- ☆17Updated 2 weeks ago
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆69Updated 2 years ago
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.☆36Updated 2 years ago
- 🔍 Mirror of https://gerrit.wikimedia.org/g/mediawiki/extensions/CirrusSearch. See https://www.mediawiki.org/wiki/Developer_access for co…☆45Updated this week
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆22Updated last month
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 6 years ago
- Demonstration of searching PDF document with Solr, Tika, and Tesseract☆32Updated last year
- PAWS: A Web Shell☆53Updated last week
- A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used st…☆24Updated last year
- Named entity annotation tool☆28Updated 2 years ago
- tesseractXplore a tesseract ease of use gui with full control☆24Updated 4 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated 2 years ago
- Repo for the Wikimedia Listeria bot☆27Updated 3 weeks ago
- WordNet-LMF formats☆24Updated 2 months ago
- An index data structure for approximate string search.☆23Updated 6 years ago