JoelNiklaus / LegalDatasetsLinks

This repository serves as a collection of scrapers procuring and structuring various legal datasets

☆17

Alternatives and similar repositories for LegalDatasets

Users that are interested in LegalDatasets are comparing it to the libraries listed below

Sorting:

S1M0N38 / dspy-arxiv
Explore the use of DSPy for extracting features from PDFs 🔎
☆45Updated last year
NirantK / agentai
Text to Python Objects via a LLM Function Call
☆58Updated last year
weaviate-tutorials / Hurricane
Writing Blog Posts with Generative Feedback Loops!
☆50Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 7 months ago
Breakend / PileOfLaw
A dataset for pretraining language models targeted for legal tasks.
☆134Updated 3 years ago
langchain-ai / prompt-eval-recommendation
Streamlit app for recommending eval functions using prompt diffs
☆29Updated last year
BerriAI / bettertest
☆75Updated last year
fw-ai / cookbook
Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.
☆120Updated this week
Liquid-Legal-Institute / Legal-LLMs-GPTs
Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal
☆94Updated 2 years ago
nomic-ai / semantic-search-app-template
Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI
☆115Updated last year
louisbrulenaudet / docutron
Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.
☆26Updated last year
JSv4 / AtticusClassifier
Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus
☆14Updated 4 years ago
allenai / CommaQA
Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents
☆24Updated 3 years ago
PrithivirajDamodaran / SPLADERunner
Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…
☆32Updated 11 months ago
dair-ai / llm-evaluator
Example for Logging LLM Evaluator Prompt Responses
☆18Updated last year
JustlyAI / lmss_entity_extractor
Tool to apply Legal Matter Specification Standard (LMSS) to documents
☆12Updated 11 months ago
iulia-b10 / multilingual-embedding-models
☆20Updated last year
jina-ai / jerboa
LLM finetuning
☆42Updated last year
TuanaCelik / unstructuredio-haystack
💙 Unstructured Data Connectors for Haystack 2.0
☆17Updated last year
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated 9 months ago
Law-AI / ecir2023tutorial
This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …
☆41Updated 2 years ago
iulia-b10 / query_transformations
☆14Updated last year
coastalcph / lexlms
LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development
☆20Updated 2 years ago
Knowledgator / unlimited_classifier
Universal text classifier for generative models
☆24Updated last year
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 9 months ago
ravi03071991 / KT_Generator
Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.
☆74Updated last year
BBischof / yapping
Verbosity control for AI agents
☆64Updated last year
microsoft / Structured-Entity-Extraction
Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"
☆42Updated 4 months ago
ajndkr / pytest-langchain
✅ Pytest-style test runner for langchain projects
☆25Updated 2 years ago