harveyai / biglaw-benchLinks
☆80Updated 8 months ago
Alternatives and similar repositories for biglaw-bench
Users that are interested in biglaw-bench are comparing it to the libraries listed below
Sorting:
- An open science effort to benchmark legal reasoning in foundation models☆457Updated 10 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆432Updated last year
- This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.☆108Updated last month
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆173Updated 9 months ago
- ☆195Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆131Updated 6 months ago
- Collection of Datasets for Legal Text Processing☆110Updated 2 years ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆107Updated last year
- awesome synthetic (text) datasets☆289Updated last week
- Pinecone text client library☆64Updated 4 months ago
- A collection of datasets and tasks for legal machine learning☆388Updated last year
- Attribute (or cite) statements generated by LLMs back to in-context information.☆250Updated 9 months ago
- A dataset for pretraining language models targeted for legal tasks.☆134Updated 3 years ago
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆271Updated last month
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆113Updated last week
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.☆99Updated last year
- ☆184Updated 7 months ago
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆111Updated 10 months ago
- A small library of LLM judges☆232Updated 3 weeks ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆65Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆101Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated 9 months ago
- Generalist and Lightweight Model for Text Classification☆139Updated last month
- ☆151Updated this week
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆343Updated last month
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆105Updated 3 weeks ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆324Updated 8 months ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆212Updated 2 years ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆123Updated last week