JoelNiklaus / LegalDatasetsLinks
This repository serves as a collection of scrapers procuring and structuring various legal datasets
β18Updated 2 years ago
Alternatives and similar repositories for LegalDatasets
Users that are interested in LegalDatasets are comparing it to the libraries listed below
Sorting:
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- Explore the use of DSPy for extracting features from PDFs πβ45Updated last year
- Streamlit app for recommending eval functions using prompt diffsβ29Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROβ¦β52Updated 6 months ago
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legalβ98Updated 2 years ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.β26Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β46Updated last year
- A dataset for pretraining language models targeted for legal tasks.β139Updated 3 years ago
- Lightweight Non-Parametric Embedding Fine-Tuningβ36Updated 3 weeks ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ12Updated last year
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agentsβ24Updated 3 years ago
- Official Repo for CRMArena and CRMArena-Proβ118Updated 3 months ago
- π Unstructured Data Connectors for Haystack 2.0β17Updated 2 years ago
- Solve Geometric & Graph Problems with Large Language Modelsβ33Updated 2 years ago
- Universal text classifier for generative modelsβ25Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 11 months ago
- Mixing Language Models with Self-Verification and Meta-Verificationβ110Updated 9 months ago
- β20Updated 2 years ago
- Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.β123Updated last week
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)β86Updated 3 weeks ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β75Updated 11 months ago
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.β76Updated 2 years ago
- A collection of datasets and other resources for legal text processing.β125Updated 2 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated last year
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpusβ14Updated 4 years ago
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and imagesβ42Updated last year
- β50Updated last year
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Developmentβ20Updated 2 years ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ119Updated last week
- Code to extract Knowledge Graph from normal, unstructured text and visualize the resulting graphβ57Updated last year