0x7o / RETRO-transformerLinks
Easy-to-use Retrieval-Enhanced Transformer implementation
☆11Updated 2 years ago
Alternatives and similar repositories for RETRO-transformer
Users that are interested in RETRO-transformer are comparing it to the libraries listed below
Sorting:
- ☆177Updated 2 years ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆78Updated last year
- ☆69Updated last year
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆51Updated 11 months ago
- Retrieval as Attention☆83Updated 2 years ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆50Updated 7 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆142Updated 7 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆89Updated 6 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆44Updated 11 months ago
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆61Updated 2 years ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆133Updated 2 weeks ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆101Updated last year
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆21Updated 9 months ago
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆31Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆142Updated 5 months ago
- Official codebase for permutation self-consistency.☆18Updated last year
- ☆31Updated 7 months ago
- A toolkit for building dense retrievers with deep language models.☆60Updated 3 years ago
- Unofficial implementation of AlpaGasus☆91Updated last year
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆76Updated 6 months ago
- ☆68Updated 2 years ago
- ☆23Updated 3 weeks ago
- Code for Zero-Shot Tokenizer Transfer☆128Updated 4 months ago
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆29Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆52Updated last month
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆89Updated 7 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆162Updated last month