MadryLab / context-cite
Attribute (or cite) statements generated by LLMs back to in-context information.
☆219Updated 5 months ago
Alternatives and similar repositories for context-cite:
Users that are interested in context-cite are comparing it to the libraries listed below
- awesome synthetic (text) datasets☆265Updated 4 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆229Updated last month
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆196Updated this week
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆131Updated 4 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆202Updated 4 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆161Updated 3 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆104Updated 6 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆102Updated 5 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆167Updated last month
- Evaluating LLMs with fewer examples☆147Updated 11 months ago
- AWM: Agent Workflow Memory☆252Updated last month
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆152Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆233Updated 4 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆227Updated 7 months ago
- ☆142Updated 11 months ago
- A simple unified framework for evaluating LLMs☆206Updated 2 weeks ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆160Updated 6 months ago
- ☆115Updated 7 months ago
- code for training & evaluating Contextual Document Embedding models☆176Updated 2 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆253Updated 8 months ago
- This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.☆76Updated 2 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆207Updated 4 months ago
- LOFT: A 1 Million+ Token Long-Context Benchmark☆182Updated this week
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆147Updated 4 months ago
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆136Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆168Updated 2 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆187Updated this week
- Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".☆195Updated 5 months ago
- Comprehensive benchmark for RAG☆147Updated 4 months ago
- ☆119Updated 5 months ago