jzbjyb / ReAtt
Retrieval as Attention
☆83Updated 2 years ago
Alternatives and similar repositories for ReAtt:
Users that are interested in ReAtt are comparing it to the libraries listed below
- ConTextual Mask Auto-Encoder for Dense Passage Retrieval☆35Updated 4 months ago
- TBC☆26Updated 2 years ago
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆77Updated last year
- ☆48Updated 11 months ago
- ☆54Updated 2 years ago
- DEMix Layers for Modular Language Modeling☆53Updated 3 years ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 6 months ago
- Code for paper 'Data-Efficient FineTuning'☆29Updated last year
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Updated last year
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆22Updated 8 months ago
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆64Updated last year
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆64Updated 2 years ago
- ACL'23: Unified Demonstration Retriever for In-Context Learning☆36Updated last year
- Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrieval☆30Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆42Updated 8 months ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆43Updated 5 months ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Updated 2 years ago
- The original Backpack Language Model implementation, a fork of FlashAttention☆66Updated last year
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆20Updated 7 months ago
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆69Updated last year
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆100Updated 2 years ago
- ☆95Updated last year
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆30Updated last year
- [EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"☆34Updated 2 years ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆58Updated 2 months ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆61Updated 4 months ago
- This project maintains a reading list for general text generation tasks☆65Updated 3 years ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆45Updated last year
- [WWW 2024] The official repo for paper "Scalable and Effective Generative Information Retrieval".☆54Updated 10 months ago