The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl
☆43Dec 30, 2024Updated last year
Alternatives and similar repositories for LegalAgentBench
Users that are interested in LegalAgentBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- StaRD: Statute Retrieval Dataset based on Real-World Legal Consultation☆20Apr 24, 2025Updated 11 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆38Jun 23, 2025Updated 9 months ago
- LexEval: A Comprehensive Benchmark for Evaluating Large Language Models in Legal Domain☆93Oct 30, 2024Updated last year
- ☆18Jun 3, 2024Updated last year
- CS294/194-196 Large Language Model Agents☆46Dec 20, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- Repository for the paper: "Using deep learning to predict outcomes of legal appeals better than human experts"☆10Aug 1, 2022Updated 3 years ago
- ☆14May 9, 2024Updated last year
- Official code space for "SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development"☆61Oct 24, 2025Updated 5 months ago
- ☆12Jan 7, 2020Updated 6 years ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆131Mar 18, 2025Updated last year
- Code repo for FaStfact: Faster, Stronger Long-Form Factuality Evaluations in LLMs.☆32Nov 5, 2025Updated 5 months ago
- Test-time compute in information retrieval☆54Jul 8, 2025Updated 9 months ago
- A script to draw attention heat map with matplotlib☆14May 7, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- SimKO: Simple Pass@K Policy Optimization☆28Oct 24, 2025Updated 5 months ago
- On the Robustness of GUI Grounding Models Against Image Attacks☆12Apr 8, 2025Updated last year
- code for Modeling Dynamic Pairwise Attention for Crime Classification over Legal Articles, SIGIR 2018☆12Jan 4, 2019Updated 7 years ago
- Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)☆20Updated this week
- ☆13Sep 26, 2024Updated last year
- Grouping and Recognize speaker from an animation video. 从动漫中提取每一个说话人。☆13May 8, 2024Updated last year
- [SIGIR 2025] This is the code repo for our SIGIR'25 paper: Enhancing the Patent Matching Capability of Large Language Models via Memory G…☆19Apr 22, 2025Updated 11 months ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- ☆14May 20, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves☆17Jul 11, 2025Updated 9 months ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 2 years ago
- This repository is a collection of legal instruction datasets☆26Jul 12, 2024Updated last year
- Creating an analytics application for military aircraft using ADS-B.☆11Jul 14, 2024Updated last year
- A platform for building reliable AI agents☆93Apr 3, 2026Updated last week
- ☆116Mar 11, 2026Updated last month
- [ICLR 2025] "GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation", Tao Feng, Yihang Sun, Jiaxuan You☆18Mar 18, 2025Updated last year
- 🤖 A multilingual translation tool that automatically converts Hugging Face's daily AI research papers into 🇯🇵 Japanese, 🇰🇷 Korean, �…☆18Updated this week
- ☆20May 12, 2019Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆12Jul 21, 2025Updated 8 months ago
- ☆33Jan 10, 2026Updated 3 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 8 months ago
- A Benchmark for Multi-Stage Legal Case Documents Generation☆16Feb 24, 2025Updated last year
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Apr 4, 2024Updated 2 years ago
- 强化学习课程,主要是如何用强化学习解决问题☆15Dec 10, 2024Updated last year
- Data and code for <Precedent-Enhanced Legal Judgment Prediction with LLM and Domain-Model Collaboration>, will be updated soon.☆15Mar 21, 2024Updated 2 years ago