MohammedBelkacem / corpus-kab
Tuddar, ismawen d imeḍqan
☆9Updated 5 years ago
Alternatives and similar repositories for corpus-kab:
Users that are interested in corpus-kab are comparing it to the libraries listed below
- Natural language processing for the kabyle language☆15Updated 4 years ago
- Automatic Speech Recognition (ASR) - Kabyle☆17Updated 4 years ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆41Updated 4 months ago
- Tool to collect and review sentences for Common Voice☆81Updated last year
- Open information and community for machine translation☆74Updated last week
- Scraping Wikipedia for fair use sentences☆53Updated last year
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆27Updated 5 years ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆37Updated 2 years ago
- free French treebank☆32Updated 8 years ago
- Bitextor generates translation memories from multilingual websites☆292Updated 4 months ago
- Listening-based language learning☆54Updated last year
- Crawler for linguistic corpora☆205Updated last year
- German Morphological Analyzer☆47Updated 3 years ago
- Linguistic processing for Common Voice☆55Updated last year
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆15Updated 5 years ago
- The code, training pipeline, and models that power Firefox Translations☆186Updated this week
- The Open Multilingual Wordnet☆61Updated 10 months ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Updated 4 years ago
- Unitex/GramLab Language Resources☆19Updated 2 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆191Updated 4 years ago
- Python framework for processing Universal Dependencies data☆55Updated this week
- ElixirFM Functional Arabic Morphology☆43Updated 2 years ago
- Efficient teacher-student models and scripts to make them☆50Updated last year
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.☆161Updated 3 years ago
- Corpus preprocessing☆95Updated last year
- ☆18Updated 8 years ago
- A living document for all things Common Voice.☆14Updated 9 months ago
- Lexical data at Unicode☆68Updated 7 months ago
- Democratizing NLP!☆104Updated last year