CSHaitao / LexEvalLinks

LexEval: A Comprehensive Benchmark for Evaluating Large Language Models in Legal Domain

☆72

Alternatives and similar repositories for LexEval

Users that are interested in LexEval are comparing it to the libraries listed below

Sorting:

OpenMatch / ActiveRAG
This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".
☆107Updated 9 months ago
zorazrw / filco
[Preprint] Learning to Filter Context for Retrieval-Augmented Generaton
☆193Updated last year
RUC-NLPIR / OmniEval
Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"
☆69Updated 7 months ago
dongguanting / FollowRAG
The demo, code and data of FollowRAG
☆74Updated last month
TableBench / TableBench
Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"
☆67Updated 2 months ago
ConiferLM / Conifer
Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models
☆88Updated last year
zjunlp / OneGen
[EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.
☆148Updated 8 months ago
CSHaitao / LegalAgentBench
The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl
☆28Updated 7 months ago
IAAR-Shanghai / PGRAG
PGRAG
☆52Updated last year
RUC-NLPIR / CORAL
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation
☆58Updated 2 months ago
MozerWang / Loong
[EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA
☆139Updated 8 months ago
zjunlp / FactCHD
[IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection
☆88Updated last year
dongguanting / DPA-RAG
The code and data of DPA-RAG, accepted by WWW 2025 main conference.
☆61Updated 6 months ago
GAIR-NLP / ProX
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆255Updated 3 weeks ago
CraftJarvis / RAT
Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".
☆241Updated last year
yale-nlp / MCTS-RAG
☆58Updated last month
plageon / SlimPlm
Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)
☆67Updated 2 months ago
OpenMatch / RAG-DDR
[ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…
☆41Updated 5 months ago
RAG-Gym / RAG-Gym
Official repository for RAG-Gym
☆111Updated 4 months ago
oneal2000 / PRAG
Code for Parametric RAG, SIGIR 2025 Full Paper
☆182Updated 3 months ago
Hannibal046 / xRAG
[Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token
☆148Updated last year
SqueezeAILab / LLM2LLM
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
☆186Updated last year
xlang-ai / BRIGHT
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
☆152Updated 2 months ago
xsc1234 / Search-in-the-Chain
Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks
☆57Updated last year
thu-coai / CritiqueLLM
☆144Updated last year
carriex / recomp
RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.
☆135Updated 2 months ago
RUCAIBox / SimpleDeepSearcher
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
☆91Updated 2 months ago
zjunlp / AutoAct
[ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
☆229Updated 6 months ago
xlang-ai / Spider2-V
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
☆129Updated 11 months ago
ADaM-BJTU / AutoCoA
AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…
☆122Updated 4 months ago