konstantinjdobler / focusLinks

[EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"

☆34

Alternatives and similar repositories for focus

Users that are interested in focus are comparing it to the libraries listed below

Sorting:

CPJKU / wechsel
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
☆84Updated last year
malteos / clp-transfer
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
☆30Updated 2 years ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆95Updated 2 years ago
google-research / metricx
☆112Updated 10 months ago
bigscience-workshop / multilingual-modeling
BLOOM+1: Adapting BLOOM model to support a new unseen language
☆73Updated last year
bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆138Updated 8 months ago
google-research / t5x_retrieval
☆101Updated 2 years ago
ltgoslo / ltg-bert
LTG-Bert
☆34Updated last year
huggingface / that_is_good_data
☆65Updated 2 years ago
cisnlp / Glot500
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023
☆104Updated last year
martiansideofthemoon / rankgen
Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…
☆138Updated 2 years ago
google-research / mt-metrics-eval
Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.
☆116Updated 6 months ago
gsarti / t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
☆58Updated 3 years ago
ZurichNLP / mbr
Minimum Bayes Risk Decoding for Hugging Face Transformers
☆60Updated last year
ltgoslo / gpt-bert
Official implementation of "GPT or BERT: why not both?"
☆60Updated 2 months ago
ielab / Starbucks
Starbucks: Improved Training for 2D Matryoshka Embeddings
☆22Updated 3 months ago
bigscience-workshop / data_tooling
Tools for managing datasets for governance and training.
☆85Updated 2 weeks ago
sileod / tasknet
Easy modernBERT fine-tuning and multi-task learning
☆61Updated 3 months ago
catie-aq / flashT5
A fast implementation of T5/UL2 in PyTorch using Flash Attention
☆107Updated 6 months ago
jaketae / ensemble-transformers
Ensembling Hugging Face transformers made easy
☆63Updated 2 years ago
cisnlp / ofa
A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining
☆19Updated last year
juletx / self-translate
Do Multilingual Language Models Think Better in English?
☆42Updated 2 years ago
nyu-mll / SQuALITY
Query-focused summarization data
☆42Updated 2 years ago
asahi417 / lm-vocab-trimmer
Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…
☆54Updated 11 months ago
zouharvi / tokenization-scorer
Simple-to-use scoring function for arbitrarily tokenized texts.
☆46Updated 7 months ago
cisnlp / GlotLID
💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
☆162Updated 4 months ago
MicrosoftTranslator / NTREX
NTREX -- News Test References for MT Evaluation
☆86Updated last year
shayne-longpre / a-pretrainers-guide
☆72Updated 2 years ago
castorini / mr.tydi
Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.
☆79Updated 3 years ago
NathanGodey / headless-lm
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆27Updated last year