openlanguagedata / awesome-new-languages-in-machine-translationLinks
A list of initiatives for adding new languages to opensource machine translation models
☆20Updated this week
Alternatives and similar repositories for awesome-new-languages-in-machine-translation
Users that are interested in awesome-new-languages-in-machine-translation are comparing it to the libraries listed below
Sorting:
- ☆13Updated 2 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆23Updated 2 years ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆157Updated 7 months ago
- T5-based (russian) text normalization☆22Updated last year
- Fine-tuned Multilingual BERT and Multilingual USE for sentiment analysis in Russian. RuReviews, RuSentiment, Kaggle Russian News Dataset,…☆50Updated 4 years ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆17Updated 5 months ago
- Здесь собирается каталог ссылок на полезные языковые ресурсы башкирского языка☆14Updated last year
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆58Updated 4 years ago
- Punctuation and casing restoration for the Russian Language (BERT-based)☆23Updated 3 years ago
- Russian Corpus of Linguistic Acceptability☆44Updated 10 months ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆23Updated 5 years ago
- Top ML papers of the week.☆38Updated this week
- Простой нормализатор текстов перед синтезом речи☆33Updated last year
- Train punctuation and capitalization models for different languages☆25Updated 3 years ago
- ☆26Updated 4 months ago
- Question answering on russian with XLMRobertaLarge as a service☆21Updated 3 years ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆61Updated 9 months ago
- python package russtress accentuates russian text☆56Updated 5 years ago
- ☆58Updated last year
- Augmentex — a library for augmenting texts with errors☆65Updated last year
- This project is concerned with my participating in the RuNNE competition https://github.com/dialogue-evaluation/RuNNE☆12Updated 2 years ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆13Updated last year
- Extracts parallel corpora from the 2 raw texts in different languages.☆36Updated 2 years ago
- Сайт проекта☆18Updated 11 months ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆68Updated 2 years ago
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆35Updated 3 years ago
- Нейронная сеть для восстановления пунктуации на русском языке.☆20Updated 3 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆37Updated 11 months ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆49Updated 4 months ago
- Probing suite for evaluation of Russian embedding and language models☆33Updated 10 months ago