apertium / lttoolbox
Finite state compiler, processor and helper tools used by apertium
☆18Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for lttoolbox
- Python library to parse Apertium stream format☆13Updated last year
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆14Updated 4 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- A lexicon compiler for non-suffixational morphologies☆11Updated 4 months ago
- Helsinki Finite-State Technology (library and application suite)☆123Updated this week
- Automatically exported from code.google.com/p/foma☆117Updated 4 months ago
- now you can even use apertium from python☆31Updated 9 months ago
- Transliteration module for Indian Languages☆77Updated last year
- Code for the ACL 2020 Paper on Schwa Deletion in Hindi and Punjabi☆17Updated last year
- Python Finite-State Toolkit☆45Updated last week
- HFST optimized-lookup standalone library and command line tool☆12Updated 6 years ago
- Jupyter notebooks for course "Computational Morphology with HFST".☆15Updated 2 years ago
- A Python based API to access Indian language WordNets.☆37Updated 2 years ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 5 months ago
- ☆63Updated 6 months ago
- Transform TMX to text☆29Updated last year
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆69Updated this week
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆40Updated last year
- The Open Multilingual Wordnet☆58Updated 6 months ago
- eXtensible Interlinear Glossed Text☆31Updated 2 years ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆34Updated 2 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆86Updated 10 months ago
- A multilingual linked idioms data set.☆17Updated 6 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆150Updated 5 months ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆149Updated last year
- Automatically exported from code.google.com/p/hunpos☆11Updated 6 years ago
- Collaborative data curation for Glottolog☆152Updated this week
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆70Updated this week
- Supervised learning of morphology☆28Updated 7 years ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆27Updated 5 years ago