import-myself / MembenchLinks
Membenchmark repository
☆23Updated last month
Alternatives and similar repositories for Membench
Users that are interested in Membench are comparing it to the libraries listed below
Sorting:
- ☆31Updated 4 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆137Updated last year
- Awesome papers for role-playing with language models☆205Updated 10 months ago
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)☆71Updated 4 months ago
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆129Updated 2 months ago
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆85Updated 2 months ago
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆59Updated 4 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆56Updated 9 months ago
- Some example codes for drawing figures in research paper☆34Updated 3 years ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆258Updated 2 weeks ago
- SPRING: Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models☆23Updated 8 months ago
- ☆21Updated last year
- this is an implementation for the paper Improve Mathematical Reasoning in Language Models by Automated Process Supervision from google de…☆39Updated 2 months ago
- This is the code repo for the paper "Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning".☆25Updated last month
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆62Updated 8 months ago
- NeurIPS 2025: Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs☆63Updated last month
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆81Updated 3 months ago
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆235Updated last year
- The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static t…☆45Updated last week
- A comprehensive collection of process reward models.☆108Updated 2 months ago
- ☆111Updated this week
- ☆162Updated last year
- The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl☆30Updated 8 months ago
- Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering☆57Updated 6 months ago
- ☆345Updated 3 months ago
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆64Updated 6 months ago
- ☆337Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆127Updated 6 months ago
- ☆99Updated 11 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆152Updated last year