Guitaricet / my_pefty_llama
Minimal implementation of multiple PEFT methods for LLaMA fine-tuning
☆13Updated last year
Alternatives and similar repositories for my_pefty_llama:
Users that are interested in my_pefty_llama are comparing it to the libraries listed below
- Embedding Recycling for Language models☆38Updated last year
- ☆44Updated 4 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆119Updated 2 years ago
- ☆38Updated 11 months ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆47Updated last year
- ☆73Updated 11 months ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Transformers at any scale☆41Updated last year
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆70Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆31Updated last year
- ☆54Updated 2 years ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆38Updated last year
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated 2 years ago
- Truly flash T5 realization!☆64Updated 10 months ago
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Updated last year
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆24Updated 2 years ago
- My explorations into editing the knowledge and memories of an attention network☆34Updated 2 years ago
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆27Updated last month
- ☆25Updated last month
- SILO Language Models code repository☆81Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆56Updated 8 months ago
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 2 years ago
- Index of URLs to pdf files all over the internet and scripts☆23Updated last year
- The original Backpack Language Model implementation, a fork of FlashAttention☆66Updated last year
- Efficient Memory-Augmented Transformers☆34Updated 2 years ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆31Updated 9 months ago
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆71Updated 7 months ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆71Updated last year