harveyai / biglaw-bench
☆63Updated 4 months ago
Alternatives and similar repositories for biglaw-bench:
Users that are interested in biglaw-bench are comparing it to the libraries listed below
- An open science effort to benchmark legal reasoning in foundation models☆415Updated 7 months ago
- A dataset for pretraining language models targeted for legal tasks.☆127Updated 2 years ago
- This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.☆79Updated 2 months ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆201Updated last year
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal☆86Updated last year
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆102Updated 11 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆107Updated 2 weeks ago
- Collection of Datasets for Legal Text Processing☆95Updated last year
- SALI LMSS: Legal Matter Standard Specification☆60Updated last week
- ☆156Updated 4 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆68Updated 8 months ago
- A spaCy wrapper for GliNER☆109Updated 2 months ago
- A collection of datasets and tasks for legal machine learning☆364Updated 9 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆56Updated 8 months ago
- Generalist and Lightweight Model for Text Classification☆110Updated this week
- Kelvin Legal Data OS - Public Examples☆18Updated last year
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆87Updated 2 years ago
- ☆42Updated last year
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆107Updated 6 months ago
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆160Updated 6 months ago
- A simple library for segmenting legal texts☆15Updated last year
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆25Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆416Updated last year
- AI + Legal APIs: A Tool-Based Retrieval Augmented Generation Workbench for Legal AI UX Research.☆67Updated 5 months ago
- Preprocessing pipeline notebooks and API supporting text extraction from SEC documents☆143Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆126Updated last year
- Mining Legal Arguments in Court Decisions - Data and software☆66Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated 10 months ago
- ☆194Updated 10 months ago