chapayevdauren / kazakh-language-corpusLinks
Open Source Kazakh Corpus
☆20Updated 2 years ago
Alternatives and similar repositories for kazakh-language-corpus
Users that are interested in kazakh-language-corpus are comparing it to the libraries listed below
Sorting:
- Apertium linguistic data for Kazakh☆21Updated 2 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆104Updated 4 years ago
- Comparing quality and performance of NLP systems for Russian language☆49Updated 2 years ago
- Deep Learning based NLP modeling for Russian language☆240Updated 2 years ago
- Rule-based token, sentence segmentation for Russian language☆276Updated 2 years ago
- A Python wrapper for the RuWordNet thesaurus☆72Updated last year
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆156Updated last year
- Large silver standart Russian corpus with NER, morphology and syntax markup☆71Updated 2 years ago
- Sentiment analysis library for russian language☆320Updated 2 years ago
- Russian data from the SynTagRus corpus.☆86Updated 2 months ago
- Russian language models for spaCy☆241Updated 4 years ago
- Russian SuperGLUE benchmark☆111Updated 2 years ago
- Probing suite for evaluation of Russian embedding and language models☆33Updated last year
- nlp workshop at datafest siberia 2019☆22Updated 3 years ago
- Python text speller☆40Updated last year
- ☆86Updated 3 years ago
- NLP tools for Kazakh language☆35Updated 3 years ago
- Links to Russian corpora + Python functions for loading and parsing☆309Updated 2 years ago
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆123Updated 4 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Updated 7 years ago
- Accentor and transcriptor for Russian language☆132Updated 3 years ago
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆41Updated 4 years ago
- SpaCy official Russian model proposal☆32Updated 5 years ago
- ANYKS Spell-Checker☆19Updated 3 years ago
- Simple python lib to tokenize texts into sentences and sentences to words. Small, fast and robust. Comes with ukrainian flavour☆61Updated 2 years ago
- Topic modeling with BigARTM: an interactive book☆59Updated 7 years ago
- ☆33Updated 8 years ago
- python package russtress accentuates russian text☆62Updated 5 years ago
- Corpus of Russian news articles collected from Lenta.Ru☆144Updated 3 years ago
- ☆33Updated 6 years ago