Word Embeddings for Low Resource Languages: The Case of Buryat
☆10Mar 12, 2025Updated last year
Alternatives and similar repositories for burvec
Users that are interested in burvec are comparing it to the libraries listed below
Sorting:
- Armenian alphabet trainer☆12May 10, 2022Updated 3 years ago
- Morphological analysis for Udmurt.☆12Feb 17, 2026Updated last month
- Evaluation tools for the RUSSE evaluation campaign.☆37Jun 11, 2017Updated 8 years ago
- Tools for handling GRNTI list☆10Sep 2, 2023Updated 2 years ago
- Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google…☆18Jun 27, 2025Updated 8 months ago
- Corset is a web-based data selection portal that helps you getting relevant data from massive amounts of parallel data.☆21Nov 6, 2023Updated 2 years ago
- The Participant's Kit for RUSSE 2018 WSI&D Shared Task☆12Nov 23, 2022Updated 3 years ago
- 🌻 MediaWiki extension allowing mass recording of clean, well cut, well named pronunciation files.☆17Updated this week
- Miscellaneous projects, too small to warrant their own github projects, in their own subdirectories here.☆14Aug 15, 2020Updated 5 years ago
- A list of initiatives for adding new languages to opensource machine translation models☆21Dec 2, 2025Updated 3 months ago
- ☆28Jan 13, 2026Updated 2 months ago
- TextoKit - is a set of components for Natural Language Processing based on Apache UIMA platform.☆16Jul 6, 2016Updated 9 years ago
- Backend web application and API for the Estonian ASR pipeline.☆18Jul 11, 2025Updated 8 months ago
- This is the official implementation of NeurIPS 2021 "One Question Answering Model for Many Languages with Cross-lingual Dense Passage Ret…☆71Apr 1, 2022Updated 3 years ago
- Toxic Comments Detection in Russian.☆29Feb 19, 2021Updated 5 years ago
- Tools for assessing Finnish poetry: rhymes, meter, hyphenation of Finnish and so on.☆13Dec 13, 2023Updated 2 years ago
- Примеры пропозалов для подачи заявки в Open.TLab☆27Dec 15, 2022Updated 3 years ago
- ☆19Oct 14, 2021Updated 4 years ago
- Dead project -- feel free to fork and update!☆30Mar 26, 2015Updated 10 years ago
- Comparing quality and performance of NLP systems for Russian language☆51Jul 24, 2023Updated 2 years ago
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- ☆51Nov 20, 2017Updated 8 years ago
- Веб-версия "Грамматического словаря" А. А. Зализняка☆21Jan 7, 2026Updated 2 months ago
- Concept dictionary☆39Apr 4, 2024Updated last year
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Apr 8, 2023Updated 2 years ago
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆93May 27, 2023Updated 2 years ago
- ☆11Dec 9, 2020Updated 5 years ago
- Wikipedia-based Explicit Semantic Analysis, as described by Gabrilovich and Markovitch☆36May 13, 2020Updated 5 years ago
- Using embedding-based loss functions for phonetics/speech recognition.☆17Nov 24, 2014Updated 11 years ago
- Tools for fuzzy string search in text and dictionaries written in Java☆10Dec 24, 2015Updated 10 years ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- ☆28Jun 27, 2019Updated 6 years ago
- fast trainer for educational purposes☆24Mar 12, 2026Updated last week
- Automatic speech transcription and speaker identification pipeline based on Kaldi and Nextflow☆32Nov 7, 2025Updated 4 months ago
- Репоззиторий для курса "Программирование и компьютерные инструменты лингвистических исследований" в 2016-2017 уч. году.☆21May 22, 2017Updated 8 years ago
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 7 years ago
- Distributed implementation of Robust PLSA using Spark☆12Apr 29, 2021Updated 4 years ago
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆21Jan 17, 2020Updated 6 years ago
- RUSSE: Russian Semantic Evaluation.☆16Mar 1, 2022Updated 4 years ago