hoorangyee / LRAGELinks
A framework for evaluating RAG pipelines, specifically adapted for the legal domain.
☆68Updated 2 months ago
Alternatives and similar repositories for LRAGE
Users that are interested in LRAGE are comparing it to the libraries listed below
Sorting:
- Code that accompanies the public release of the paper Lost in Conversation (https://arxiv.org/abs/2505.06120)☆172Updated 3 months ago
- RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation [ACL 2025]☆118Updated 8 months ago
- The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".☆83Updated 6 months ago
- ☆156Updated 5 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆189Updated this week
- An Automatic Prompt Optimization Framework for Large Language Models☆123Updated 2 months ago
- Official code repository for Sketch-of-Thought (SoT)☆128Updated 5 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆343Updated 3 months ago
- Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)☆228Updated 2 weeks ago
- [NAACL2025] LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications☆124Updated 3 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆119Updated 8 months ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆44Updated 2 months ago
- ☆82Updated 11 months ago
- Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"☆156Updated 2 months ago
- ☆100Updated last year
- AWM: Agent Workflow Memory☆328Updated 8 months ago
- ☆95Updated 2 weeks ago
- ☆119Updated last year
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆77Updated 6 months ago
- ☆78Updated last week
- ☆218Updated 7 months ago
- ☆146Updated last year
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆441Updated last month
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆159Updated last month
- Source code for the collaborative reasoner research project at Meta FAIR.☆102Updated 5 months ago
- Official Repo for CRMArena and CRMArena-Pro☆118Updated 3 months ago
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆105Updated 9 months ago
- MCP-based Agent Deep Evaluation System☆135Updated 2 weeks ago
- Leveraging Base Language Models for Few-Shot Synthetic Data Generation☆35Updated 2 months ago
- Complex Function Calling Benchmark.☆135Updated 8 months ago