VodLM / vodLinks
End-to-end training of Retrieval-Augmented LMs (REALM, RAG)
☆22Updated 2 years ago
Alternatives and similar repositories for vod
Users that are interested in vod are comparing it to the libraries listed below
Sorting:
- Embedding Recycling for Language models☆38Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆31Updated last year
- A Toolkit for Distributional Control of Generative Models☆73Updated 4 months ago
- ☆76Updated last year
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆26Updated 2 years ago
- Download, parse, and filter data PubMed, data-ready for The-Pile☆23Updated 3 years ago
- Code for the paper "Query-Key Normalization for Transformers"☆49Updated 4 years ago
- Transformers at any scale☆41Updated last year
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆76Updated last year
- Retrieval as Attention☆82Updated 2 years ago
- Google Research☆46Updated 3 years ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated 2 years ago
- This repository contains the code for the perspective paper "Multimodal Neural Databases" accepted at SIGIR 2023.☆19Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 3 years ago
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆75Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- ☆44Updated last year
- Pretraining Efficiently on S2ORC!☆173Updated last year
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 10 months ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Updated 2 years ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆138Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 9 months ago
- ☆45Updated 2 years ago
- ☆98Updated 2 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆138Updated 2 years ago
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆115Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Updated 5 months ago
- A repository for transformer critique learning and generation☆89Updated last year
- SILO Language Models code repository☆83Updated last year