openlanguagedata / awesome-new-languages-in-machine-translationView external linksLinks
A list of initiatives for adding new languages to opensource machine translation models
☆21Dec 2, 2025Updated 2 months ago
Alternatives and similar repositories for awesome-new-languages-in-machine-translation
Users that are interested in awesome-new-languages-in-machine-translation are comparing it to the libraries listed below
Sorting:
- ☆13Dec 7, 2022Updated 3 years ago
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 3 years ago
- King's Bounty 3 (extended JavaScript fan remake of original 1990 game)☆16Mar 2, 2024Updated last year
- ☆12May 18, 2022Updated 3 years ago
- Top ML papers of the week.☆45Updated this week
- ☆13Sep 12, 2024Updated last year
- Boost your efficiency with Fish Speech Batch Inference. Easily process multiple texts and achieve consistently great results. 🗨️🐟☆25Aug 4, 2025Updated 6 months ago
- Data package for the data sets from the book "A Handbook of Small Data Sets" by David Hand (1994)☆16Dec 13, 2024Updated last year
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆19Feb 8, 2026Updated last week
- 1st place (public LB) solution of AIJ2020 Sberbank competition (Digital Peter)☆18Nov 22, 2020Updated 5 years ago
- Сайт проекта☆18Aug 25, 2024Updated last year
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆23Jul 21, 2023Updated 2 years ago
- ML Course created for Bauman Moscow State Technical University☆65Aug 31, 2022Updated 3 years ago
- 📖 React frontend to parabible.com (cf. https://github.com/parabible/parabible-server): an intuitive way to do serious Bible study in ori…☆22Mar 4, 2023Updated 2 years ago
- AI-generated text boundary detection with RoFT☆25Sep 9, 2024Updated last year
- Framework for probing tasks☆30Mar 24, 2024Updated last year
- ☆30Dec 6, 2021Updated 4 years ago
- Tools for shrinking fastText models (in gensim format)☆182May 3, 2024Updated last year
- Reinforcement Learning Library.☆29Aug 16, 2022Updated 3 years ago
- Yet another common Python wrapper for Alice and Salut skills and bots in Telegram, VK, and Facebook☆28Mar 16, 2023Updated 2 years ago
- Swete's LXX Text from 1KY Greek with Corrections Against Manuscripts☆10Oct 11, 2020Updated 5 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆11Apr 5, 2022Updated 3 years ago
- Lingtrain Aligner — ML powered library for the accurate texts alignment.☆148Jun 27, 2025Updated 7 months ago
- Диалоговая система на базе FRED-T5☆37Jul 10, 2023Updated 2 years ago
- A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.☆37Updated this week
- Unofficial implementation of paper "InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER" (https://arxiv.…☆38Feb 14, 2024Updated 2 years ago
- RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).☆35Jul 16, 2022Updated 3 years ago
- Russian Law as Open Data☆48Feb 5, 2026Updated last week
- Public releases of ACAI data☆12Nov 6, 2025Updated 3 months ago
- This is a python toolkit and developer version package to estimate multidimensional aspects of greenness and nature exposure, such as ava…☆12Aug 27, 2023Updated 2 years ago
- Data profiling tools for Big Data☆11Nov 17, 2025Updated 3 months ago
- SpaCy official Russian model proposal☆32Jan 24, 2021Updated 5 years ago
- ☆22Jun 10, 2025Updated 8 months ago
- GPT prepping certificates of translation☆11Jan 27, 2024Updated 2 years ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆164Dec 8, 2025Updated 2 months ago
- ccrd2024.github.io.☆11Mar 9, 2024Updated last year
- ☆36Aug 25, 2022Updated 3 years ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 5 years ago
- Simple repo to finetune an LLM hosted on Hugging Face by creating a LORA☆11Dec 20, 2023Updated 2 years ago