JoelNiklaus / LegalDatasetsLinks
This repository serves as a collection of scrapers procuring and structuring various legal datasets
β18Updated 2 years ago
Alternatives and similar repositories for LegalDatasets
Users that are interested in LegalDatasets are comparing it to the libraries listed below
Sorting:
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- Explore the use of DSPy for extracting features from PDFs πβ52Updated last year
- A dataset for pretraining language models targeted for legal tasks.β141Updated 3 years ago
- π Unstructured Data Connectors for Haystack 2.0β17Updated 2 years ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.β26Updated 2 years ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agentsβ24Updated 3 years ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β47Updated last year
- Lightweight Non-Parametric Embedding Fine-Tuningβ40Updated 4 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ126Updated 3 months ago
- Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.β134Updated 3 weeks ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β84Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β69Updated 2 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β75Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ12Updated last year
- Mixing Language Models with Self-Verification and Meta-Verificationβ112Updated last year
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"β27Updated 3 years ago
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.β116Updated 6 months ago
- β75Updated 2 years ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β51Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ72Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ115Updated 10 months ago
- π A list of Haystack Integrations, maintained by the community or deepset.β99Updated last week
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β44Updated last year
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpusβ13Updated 5 years ago
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legalβ100Updated 2 years ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)β90Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.β80Updated last year
- Python library to use Pleias-RAG modelsβ68Updated 9 months ago
- A personal knowledge base that I can dump information to and help me learnβ25Updated 8 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β34Updated last year