salesforce / summary-of-a-haystackLinks
Codebase accompanying the Summary of a Haystack paper.
☆79Updated 10 months ago
Alternatives and similar repositories for summary-of-a-haystack
Users that are interested in summary-of-a-haystack are comparing it to the libraries listed below
Sorting:
- ☆124Updated 10 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆92Updated 8 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 5 months ago
- Verifiers for LLM Reinforcement Learning☆67Updated 3 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 9 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 11 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆66Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 5 months ago
- ☆53Updated 8 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆114Updated 10 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆55Updated 10 months ago
- ☆70Updated 2 weeks ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 10 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆97Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆140Updated 8 months ago
- ☆41Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆105Updated 7 months ago
- ☆57Updated 10 months ago
- ☆77Updated 6 months ago
- ☆152Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- Retrieval Augmented Generation Generalized Evaluation Dataset☆54Updated 2 weeks ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆187Updated last month
- Evaluating LLMs with fewer examples☆160Updated last year
- ☆145Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆107Updated 10 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆151Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆114Updated 3 weeks ago