fdschmidt93 / trident-nllb-llm2vec
Repository for "Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages"
☆13Updated last month
Related projects ⓘ
Alternatives and complementary repositories for trident-nllb-llm2vec
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆11Updated 9 months ago
- GlotCC Dataset and Pipline -- NeurIPS 2024☆16Updated last week
- LTG-Bert☆29Updated 10 months ago
- Official implementation of "GPT or BERT: why not both?"☆12Updated this week
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- A tiny BERT for low-resource monolingual models☆29Updated last month
- A library for data streaming and augmentation☆20Updated 7 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆23Updated 6 months ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆19Updated 2 years ago
- ☆19Updated last year
- Collection of scripts from mHuBERT-147.☆22Updated 4 months ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆70Updated 8 months ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆29Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆55Updated 5 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆15Updated this week
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated last year
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆23Updated 2 weeks ago
- A library for minimum Bayes risk (MBR) decoding☆29Updated 3 weeks ago
- ☆33Updated 3 years ago
- Experiments for XLM-V Transformers Integeration☆13Updated last year
- ☆15Updated last year
- Library for fast text representation and classification.☆28Updated 10 months ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆23Updated last year
- ☆19Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆22Updated 8 months ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆11Updated 4 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆34Updated last year