Word Embeddings for Low Resource Languages: The Case of Buryat
☆10Mar 12, 2025Updated last year
Alternatives and similar repositories for burvec
Users that are interested in burvec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Morphological analysis for Udmurt.☆12May 23, 2026Updated 2 weeks ago
- Armenian alphabet trainer☆13May 10, 2022Updated 4 years ago
- Evaluation tools for the RUSSE evaluation campaign.☆37Jun 11, 2017Updated 9 years ago
- Tools for handling GRNTI list☆10Sep 2, 2023Updated 2 years ago
- Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google…☆18Jun 27, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The Participant's Kit for RUSSE 2018 WSI&D Shared Task☆12Nov 23, 2022Updated 3 years ago
- Corset is a web-based data selection portal that helps you getting relevant data from massive amounts of parallel data.☆21Nov 6, 2023Updated 2 years ago
- 🌻 MediaWiki extension allowing mass recording of clean, well cut, well named pronunciation files.☆17Jun 4, 2026Updated last week
- Miscellaneous projects, too small to warrant their own github projects, in their own subdirectories here.☆14Aug 15, 2020Updated 5 years ago
- A list of initiatives for adding new languages to opensource machine translation models☆22Dec 2, 2025Updated 6 months ago
- ☆29Jan 13, 2026Updated 4 months ago
- TextoKit - is a set of components for Natural Language Processing based on Apache UIMA platform.☆16Jul 6, 2016Updated 9 years ago
- Tools for assessing Finnish poetry: rhymes, meter, hyphenation of Finnish and so on.☆13Dec 13, 2023Updated 2 years ago
- This is the official implementation of NeurIPS 2021 "One Question Answering Model for Many Languages with Cross-lingual Dense Passage Ret…☆71Apr 1, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Backend web application and API for the Estonian ASR pipeline.☆19Jul 11, 2025Updated 11 months ago
- Toxic Comments Detection in Russian.☆29Feb 19, 2021Updated 5 years ago
- Примеры пропозалов для подачи заявки в Open.TLab☆27Dec 15, 2022Updated 3 years ago
- ☆19Oct 14, 2021Updated 4 years ago
- Dead project -- feel free to fork and update!☆30Mar 26, 2015Updated 11 years ago
- Comparing quality and performance of NLP systems for Russian language☆50Jul 24, 2023Updated 2 years ago
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 3 years ago
- ☆50Nov 20, 2017Updated 8 years ago
- Concept dictionary☆41Apr 4, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Apr 8, 2023Updated 3 years ago
- Веб-версия "Грамматического словаря" А. А. Зализняка☆22Jan 7, 2026Updated 5 months ago
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆92May 27, 2023Updated 3 years ago
- ☆11Dec 9, 2020Updated 5 years ago
- Implementation of a simple BPE tokenizer, but in Nim☆22Jul 2, 2023Updated 2 years ago
- Wikipedia-based Explicit Semantic Analysis, as described by Gabrilovich and Markovitch☆36May 13, 2020Updated 6 years ago
- Using embedding-based loss functions for phonetics/speech recognition.☆17Nov 24, 2014Updated 11 years ago
- Tools for fuzzy string search in text and dictionaries written in Java☆10Dec 24, 2015Updated 10 years ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- fast trainer for educational purposes☆26Jun 4, 2026Updated last week
- Automatic speech transcription and speaker identification pipeline based on Kaldi and Nextflow☆33Jun 3, 2026Updated last week
- Репоззиторий для курса "Программирование и компьютерные инструменты лингвистических исследований" в 2016-2017 уч. году.☆21May 22, 2017Updated 9 years ago
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 8 years ago
- Distributed implementation of Robust PLSA using Spark☆12Apr 29, 2021Updated 5 years ago
- ☆29Jun 27, 2019Updated 6 years ago
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆21Jan 17, 2020Updated 6 years ago