anadim / the-little-retrieval-test
☆31Updated last year
Related projects ⓘ
Alternatives and complementary repositories for the-little-retrieval-test
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆69Updated last year
- ☆71Updated 6 months ago
- A framework for few-shot evaluation of autoregressive language models.☆23Updated 11 months ago
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆73Updated last year
- ☆38Updated 7 months ago
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆68Updated 10 months ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated last year
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆18Updated 3 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆61Updated 7 months ago
- Language models scale reliably with over-training and on downstream tasks☆94Updated 7 months ago
- DEMix Layers for Modular Language Modeling☆53Updated 3 years ago
- Long Context Extension and Generalization in LLMs☆39Updated 2 months ago
- ☆45Updated 9 months ago
- ☆45Updated 9 months ago
- ☆36Updated 3 months ago
- ☆50Updated 6 months ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆24Updated 2 months ago
- ☆38Updated 7 months ago
- A toolkit for scaling law research ⚖☆43Updated 8 months ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆41Updated last year
- ☆22Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆58Updated 3 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆40Updated 4 months ago
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆85Updated 3 years ago
- ☆80Updated 2 years ago
- Adding new tasks to T0 without catastrophic forgetting☆30Updated 2 years ago
- SILO Language Models code repository☆80Updated 8 months ago
- ☆26Updated 8 months ago
- ☆93Updated last year
- ☆46Updated this week