harveyai / biglaw-benchLinks

☆75

Alternatives and similar repositories for biglaw-bench

Users that are interested in biglaw-bench are comparing it to the libraries listed below

Sorting:

zeroentropy-ai / legalbenchrag
This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.
☆97Updated 3 weeks ago
patronus-ai / financebench
☆181Updated 6 months ago
predlico / ARAGOG
ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…
☆107Updated last year
Knowledgator / GLiClass
Generalist and Lightweight Model for Text Classification
☆134Updated 2 weeks ago
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆64Updated last year
MadryLab / context-cite
Attribute (or cite) statements generated by LLMs back to in-context information.
☆242Updated 8 months ago
chaitanyamalaviya / ExpertQA
[Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers
☆130Updated last year
Muhtasham / summarization-eval
📝 Reference-Free automatic summarization evaluation with potential hallucination detection
☆100Updated last year
rungalileo / hallucination-index
Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.
☆111Updated 9 months ago
coastalcph / lex-glue
LexGLUE: A Benchmark Dataset for Legal Language Understanding in English
☆208Updated 2 years ago
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆78Updated 9 months ago
openlegaldata / awesome-legal-data
Collection of Datasets for Legal Text Processing
☆106Updated 2 years ago
reglab / casehold
Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…
☆90Updated 2 years ago
Breakend / PileOfLaw
A dataset for pretraining language models targeted for legal tasks.
☆133Updated 2 years ago
zetaalphavector / RAGElo
RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker
☆112Updated 2 weeks ago
apple / ml-superposition-prompting
☆144Updated 11 months ago
SalesforceAIResearch / CRMArena
Official Repo for CRMArena and CRMArena-Pro
☆97Updated this week
jxnl / n-levels-of-rag
☆195Updated last year
HazyResearch / legalbench
An open science effort to benchmark legal reasoning in foundation models
☆443Updated 10 months ago
273v / kelvin-public-examples
Kelvin Legal Data OS - Public Examples
☆19Updated last year
illuin-tech / vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
☆213Updated 3 weeks ago
davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆283Updated 7 months ago
tonywu71 / colpali-cookbooks
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻‍🍳
☆314Updated 3 weeks ago
quotient-ai / judges
A small library of LLM judges
☆216Updated last week
stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…
☆173Updated 9 months ago
ritun16 / chain-of-verification
This repository implements the chain of verification paper by Meta AI
☆170Updated last year
MoritzLaurer / synthetic-data-blog
This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data
☆68Updated last year
isaacus-dev / semchunk
A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.
☆328Updated 2 weeks ago
chentong0 / factoid-wiki
Dense X Retrieval: What Retrieval Granularity Should We Use?
☆157Updated last year
Arize-ai / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆99Updated last year