rungalileo / ragbenchLinks
☆19Updated last year
Alternatives and similar repositories for ragbench
Users that are interested in ragbench are comparing it to the libraries listed below
Sorting:
- Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding☆87Updated last year
- [NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? https://aclanthology.org/2024.naa…☆55Updated 4 months ago
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆28Updated last year
- Large language models for document ranking.☆71Updated last month
- Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs☆34Updated last year
- ☆51Updated last year
- Code/data for MARG (multi-agent review generation)☆59Updated 2 months ago
- A Comprehensive Library for Memory of LLM-based Agents.☆94Updated 7 months ago
- Code and Data for "Language Modeling with Editable External Knowledge"☆36Updated last year
- Code for Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks (WWW 2024))☆58Updated last month
- PGRAG☆51Updated last year
- Comprehensive benchmark for RAG☆249Updated 6 months ago
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆48Updated last month
- ☆41Updated 5 months ago
- [ACL'25 Main] Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs☆36Updated 6 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆102Updated last year
- ☆38Updated last year
- ☆242Updated last year
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆28Updated 5 months ago
- ☆55Updated 10 months ago
- The SQL-RL-GEN is an algorithm based on a Reinforcement Learning approach with a reward function generated by a LLM to guide the agent's …☆18Updated 3 months ago
- ☆82Updated last year
- Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"☆33Updated 5 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆166Updated last year
- Benchmark baseline for retrieval qa applications☆118Updated last year
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆64Updated last year
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆56Updated last year
- Test-time compute in information retrieval☆47Updated 5 months ago
- ☆43Updated 2 years ago
- Self-Reflection in LLM Agents: Effects on Problem-Solving Performance☆92Updated last year