MadryLab / context-cite
Attribute (or cite) statements generated by LLMs back to in-context information.
☆228Updated 6 months ago
Alternatives and similar repositories for context-cite:
Users that are interested in context-cite are comparing it to the libraries listed below
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆233Updated 2 months ago
- awesome synthetic (text) datasets☆272Updated 5 months ago
- ☆117Updated 7 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆106Updated 7 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆104Updated 6 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆196Updated last week
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆170Updated 4 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆205Updated 5 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆163Updated 7 months ago
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆148Updated 3 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆83Updated 4 months ago
- A simple unified framework for evaluating LLMs☆209Updated last week
- Evaluating LLMs with fewer examples☆151Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆211Updated 5 months ago
- Verdict is a library for scaling judge-time compute.☆197Updated this week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆271Updated this week
- code for training & evaluating Contextual Document Embedding models☆180Updated this week
- This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.☆79Updated 3 months ago
- ☆166Updated this week
- The official evaluation suite and dynamic data release for MixEval.☆235Updated 5 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆197Updated last week
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 7 months ago
- LOFT: A 1 Million+ Token Long-Context Benchmark☆187Updated 2 weeks ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆230Updated 7 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆169Updated 2 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆155Updated last year
- ☆120Updated 6 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆187Updated 4 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆168Updated last month
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆133Updated 5 months ago