D-Star-AI / KITELinks
KITE (Knowledge-Intensive Task Evaluation) is an end-to-end benchmark for RAG pipelines
☆19Updated 10 months ago
Alternatives and similar repositories for KITE
Users that are interested in KITE are comparing it to the libraries listed below
Sorting:
- Reasoning by Communicating with Agents☆29Updated last month
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 8 months ago
- ☆16Updated last year
- ☆20Updated 2 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆20Updated 6 months ago
- ☆62Updated 11 months ago
- Query Expension for Better Query Embedding using LLMs☆52Updated 4 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆26Updated last year
- ☆24Updated 9 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- ☆46Updated 9 months ago
- ☆41Updated 6 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- ☆23Updated 4 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 9 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- ☆47Updated 4 months ago
- ☆14Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- Verifiers for LLM Reinforcement Learning☆60Updated 2 months ago
- ☆16Updated 11 months ago
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆30Updated 9 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated last year
- ☆29Updated last year
- RuleRAG: Rule-guided Retrieval-Augmented Generation with Language Models for Question Answering☆22Updated 7 months ago
- ☆48Updated 4 months ago
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆46Updated 2 weeks ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆26Updated 6 months ago
- Measuring RAG solutions throughput and latency☆17Updated 11 months ago
- ☆36Updated 2 years ago