aymeric-roucher / LongContext_vs_RAG_NeedleInAHaystackLinks
Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths
☆35Updated last year
Alternatives and similar repositories for LongContext_vs_RAG_NeedleInAHaystack
Users that are interested in LongContext_vs_RAG_NeedleInAHaystack are comparing it to the libraries listed below
Sorting:
- ☆48Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- A framework for evaluating function calls made by LLMs☆37Updated 10 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆103Updated 5 months ago
- Track the progress of LLM context utilisation☆53Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- QLoRA for Masked Language Modeling☆22Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆48Updated last year
- ☆29Updated 6 months ago
- Exploring limitations of LLM-as-a-judge☆18Updated 9 months ago
- ☆41Updated 11 months ago
- ☆49Updated 6 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆43Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆55Updated 2 weeks ago
- Analysis on the cost of encoder based models☆11Updated 3 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- ☆23Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 3 months ago
- ☆53Updated 5 months ago
- ☆43Updated 3 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆40Updated last year
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 5 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 6 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆45Updated last month
- ☆37Updated 2 years ago
- The first dense retrieval model that can be prompted like an LM☆73Updated 3 weeks ago
- Simple GRPO scripts and configurations.☆58Updated 3 months ago