JoelNiklaus / LegalDatasetsLinks
This repository serves as a collection of scrapers procuring and structuring various legal datasets
☆18Updated 2 years ago
Alternatives and similar repositories for LegalDatasets
Users that are interested in LegalDatasets are comparing it to the libraries listed below
Sorting:
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆26Updated 2 years ago
- A dataset for pretraining language models targeted for legal tasks.☆141Updated 3 years ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆52Updated last year
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal☆100Updated 2 years ago
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI☆114Updated 2 years ago
- Universal text classifier for generative models☆24Updated last year
- Lightweight Non-Parametric Embedding Fine-Tuning☆40Updated 4 months ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated 2 years ago
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆13Updated 5 years ago
- ☆75Updated 2 years ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆116Updated 6 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆53Updated 10 months ago
- Python client library for improving your LLM app accuracy☆97Updated last year
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development☆20Updated 2 years ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆74Updated last year
- 🚀 A list of Haystack Integrations, maintained by the community or deepset.☆99Updated this week
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆42Updated 2 years ago
- Solve Geometric & Graph Problems with Large Language Models☆32Updated 2 years ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆126Updated 3 months ago
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"☆27Updated 3 years ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated last year
- Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.☆134Updated 3 weeks ago
- ☆13Updated 2 years ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆184Updated last year
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- Code to extract Knowledge Graph from normal, unstructured text and visualize the resulting graph☆57Updated last year