elianap / divexplorerLinks
☆11Updated 3 years ago
Alternatives and similar repositories for divexplorer
Users that are interested in divexplorer are comparing it to the libraries listed below
Sorting:
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆82Updated 10 months ago
- ☆72Updated 2 years ago
- Measuring the Mixing of Contextual Information in the Transformer☆31Updated 2 years ago
- ☆20Updated 2 years ago
- ☆129Updated 2 years ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆25Updated last month
- Code for EMNLP 2023 findings paper "A Closer Look into Using Large Language Models for Automatic Evaluation"☆18Updated last year
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆74Updated 11 months ago
- Pre-training BART model for the Italian Language☆16Updated 2 years ago
- PyTorch reimplementation of REALM and ORQA☆22Updated 3 years ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆82Updated 3 years ago
- Randomized Positional Encodings Boost Length Generalization of Transformers☆82Updated last year
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆157Updated 2 years ago
- The original Backpack Language Model implementation, a fork of FlashAttention☆69Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆163Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆73Updated last year
- A curated list of research papers and resources on Cultural LLM.☆45Updated 9 months ago
- ☆38Updated last year
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆70Updated last year
- babyLM WhisBERT code☆20Updated last year
- Evaluation pipeline for the BabyLM Challenge 2023.☆76Updated last year
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆119Updated 2 years ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆49Updated 3 years ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆45Updated 2 years ago
- This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning (https://arxiv.org/abs/2205.1…☆132Updated last year
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Updated 3 years ago
- Research code for pixel-based encoders of language (PIXEL)☆337Updated this week
- Code for Zero-Shot Tokenizer Transfer☆133Updated 6 months ago
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆112Updated 2 years ago
- Index of URLs to pdf files all over the internet and scripts☆24Updated 2 years ago