MadryLab / context-citeLinks
Attribute (or cite) statements generated by LLMs back to in-context information.
☆268Updated 9 months ago
Alternatives and similar repositories for context-cite
Users that are interested in context-cite are comparing it to the libraries listed below
Sorting:
- awesome synthetic (text) datasets☆291Updated 3 weeks ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆115Updated 10 months ago
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆174Updated 7 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆207Updated 2 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆107Updated last month
- Evaluating LLMs with fewer examples☆160Updated last year
- ☆118Updated 11 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆240Updated 5 months ago
- Official repo for "Make Your LLM Fully Utilize the Context"☆253Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆159Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆222Updated last month
- Manage scalable open LLM inference endpoints in Slurm clusters☆268Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 10 months ago
- The first dense retrieval model that can be prompted like an LM☆81Updated 2 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆223Updated this week
- This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.☆118Updated 2 months ago
- The official evaluation suite and dynamic data release for MixEval.☆242Updated 8 months ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 10 months ago
- A small library of LLM judges☆248Updated this week
- A simple unified framework for evaluating LLMs☆235Updated 3 months ago
- ☆125Updated 10 months ago
- Inference-time scaling for LLMs-as-a-judge.☆267Updated 3 weeks ago
- ☆152Updated last year
- Complex Function Calling Benchmark.☆123Updated 6 months ago
- Synthetic Data for LLM Fine-Tuning☆120Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆87Updated 10 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆182Updated 5 months ago
- code for training & evaluating Contextual Document Embedding models☆196Updated 2 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆233Updated 9 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆188Updated last month