☆53Jun 6, 2023Updated 2 years ago
Alternatives and similar repositories for xtreme-up
Users that are interested in xtreme-up are comparing it to the libraries listed below
Sorting:
- ☆18Nov 25, 2022Updated 3 years ago
- “Data Augmentation for Cross-Domain Named Entity Recognition” (EMNLP 2021)☆20Apr 4, 2022Updated 3 years ago
- Evaluation framework for open-domain question answering.☆20May 16, 2021Updated 4 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆26Jun 2, 2021Updated 4 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- ☆13Sep 25, 2024Updated last year
- ☆13Sep 2, 2021Updated 4 years ago
- ☆10Oct 28, 2019Updated 6 years ago
- ☆10Apr 17, 2024Updated last year
- decontamination☆26Dec 3, 2025Updated 3 months ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- A Grapheme to Phoneme model using LSTM implemented in pytorch☆13Jul 6, 2022Updated 3 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated 11 months ago
- Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices☆10Aug 3, 2020Updated 5 years ago
- This dataset contains human judgements about answer equivalence. The data is based on SQuAD (Stanford Question Answering Dataset), and co…☆27Oct 24, 2022Updated 3 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- ☆15Apr 12, 2023Updated 2 years ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Dec 5, 2022Updated 3 years ago
- ☆17Oct 18, 2023Updated 2 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15May 16, 2025Updated 9 months ago
- ☆19Aug 10, 2024Updated last year
- Transfer Learning in Dialogue Benchmarking Toolkit☆14Mar 31, 2023Updated 2 years ago
- UNLP 2024 Shared Task on LLM instruction-tuning for Ukrainian☆17Apr 15, 2024Updated last year
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 2 years ago
- DEMix Layers for Modular Language Modeling☆54Feb 25, 2026Updated last week
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- ParaNames: A multilingual resource for parallel names☆39May 20, 2024Updated last year
- ☆15Aug 14, 2018Updated 7 years ago
- LLM as a Chatbot Service☆17Aug 28, 2023Updated 2 years ago
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆18Apr 29, 2024Updated last year
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆106Apr 20, 2024Updated last year
- Dense hybrid representations for text retrieval☆64Apr 3, 2023Updated 2 years ago
- Language models scale reliably with over-training and on downstream tasks☆100Apr 2, 2024Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- python-midi connection for controlling keyboard events on windows...☆15Mar 13, 2013Updated 12 years ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆20Nov 19, 2024Updated last year
- Code for paper ”Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability“☆15Jun 13, 2023Updated 2 years ago