elianap / divexplorerLinks

☆11

Alternatives and similar repositories for divexplorer

Users that are interested in divexplorer are comparing it to the libraries listed below

Sorting:

CPJKU / wechsel
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
☆82Updated 10 months ago
shayne-longpre / a-pretrainers-guide
☆72Updated 2 years ago
mt-upc / transformer-contributions
Measuring the Mixing of Contextual Information in the Transformer
☆31Updated 2 years ago
ahmetustun / hyperx
☆20Updated 2 years ago
rabeehk / compacter
☆129Updated 2 years ago
ZurichNLP / multilingual-instruction-tuning
Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"
☆25Updated last month
d223302 / A-Closer-Look-To-LLM-Evaluation
Code for EMNLP 2023 findings paper "A Closer Look into Using Large Language Models for Automatic Evaluation"
☆18Updated last year
cambridgeltl / composable-sft
A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.
☆74Updated 11 months ago
MorenoLaQuatra / bart-it
Pre-training BART model for the Italian Language
☆16Updated 2 years ago
qqaatw / pytorch-realm-orqa
PyTorch reimplementation of REALM and ORQA
☆22Updated 3 years ago
cimeister / typical-sampling
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆82Updated 3 years ago
google-deepmind / randomized_positional_encodings
Randomized Positional Encodings Boost Length Generalization of Transformers
☆82Updated last year
facebookresearch / NPM
The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)
☆157Updated 2 years ago
john-hewitt / backpacks-flash-attn
The original Backpack Language Model implementation, a fork of FlashAttention
☆69Updated 2 years ago
facebookresearch / tart
Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.
☆163Updated last year
bigscience-workshop / multilingual-modeling
BLOOM+1: Adapting BLOOM model to support a new unseen language
☆73Updated last year
faridlazuarda / cultural-llm-papers
A curated list of research papers and resources on Cultural LLM.
☆45Updated 9 months ago
allenai / bff
☆38Updated last year
microsoft / Multilingual-Evaluation-of-Generative-AI-MEGA
Code for Multilingual Eval of Generative AI paper published at EMNLP 2023
☆70Updated last year
lu-wo / whisbert
babyLM WhisBERT code
☆20Updated last year
babylm / evaluation-pipeline-2023
Evaluation pipeline for the BabyLM Challenge 2023.
☆76Updated last year
yxuansu / Contrastive_Search_Is_What_You_Need
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆119Updated 2 years ago
g8a9 / ear
Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"
☆49Updated 3 years ago
IamAdiSri / hf-trim
Reduce the size of pretrained Hugging Face models via vocabulary trimming.
☆45Updated 2 years ago
microsoft / AdaMix
This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning (https://arxiv.org/abs/2205.1…
☆132Updated last year
facebookresearch / ELECTRA-Fewshot-Learning
This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.
☆48Updated 3 years ago
xplip / pixel
Research code for pixel-based encoders of language (PIXEL)
☆337Updated this week
bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆133Updated 6 months ago
SeanNaren / minGPT
A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!
☆112Updated 2 years ago
applicaai / CCpdf
Index of URLs to pdf files all over the internet and scripts
☆24Updated 2 years ago