0x7o / RETRO-transformerLinks
Easy-to-use Retrieval-Enhanced Transformer implementation
☆10Updated 3 years ago
Alternatives and similar repositories for RETRO-transformer
Users that are interested in RETRO-transformer are comparing it to the libraries listed below
Sorting:
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆81Updated 2 years ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆51Updated last week
- Retrieval as Attention☆82Updated 3 years ago
- ☆145Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆49Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆136Updated last year
- ☆105Updated 2 years ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆204Updated last year
- Code for Zero-Shot Tokenizer Transfer☆142Updated last year
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆324Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆132Updated last year
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆40Updated 9 months ago
- Official repo to On the Generalization Ability of Retrieval-Enhanced Transformers☆46Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆222Updated last month
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆23Updated last year
- ☆51Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆88Updated last year
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆157Updated 9 months ago
- Code for the ACL 2023 long paper - Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering☆37Updated 2 years ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆126Updated last year
- ☆188Updated 6 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆190Updated 6 months ago
- The HELMET Benchmark☆197Updated last month
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆165Updated 2 years ago
- Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)☆46Updated 2 years ago
- Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)☆63Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆145Updated last year
- BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.☆236Updated 4 months ago
- A framework for few-shot evaluation of autoregressive language models.☆105Updated 2 years ago
- LOFT: A 1 Million+ Token Long-Context Benchmark☆223Updated 7 months ago