This repository serves as a collection of scrapers procuring and structuring various legal datasets
☆19Jun 16, 2023Updated 2 years ago
Alternatives and similar repositories for LegalDatasets
Users that are interested in LegalDatasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development☆22Jul 24, 2023Updated 2 years ago
- Python API for Docket Alarm☆32Sep 8, 2020Updated 5 years ago
- Python libraries for extracting from data sources like Rechtspraak, ECHR, Cellar☆13Jul 2, 2025Updated 10 months ago
- An interactive course on prompt engineering for lawyers built using streamlit.☆18Sep 1, 2024Updated last year
- The code used to evaluate embedding models on the Massive Legal Embedding Benchmark (MLEB).☆38Feb 24, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LAWLIA is an open-source computational legal framework designed to revolutionize legal reasoning and analysis. It combines the power of l…☆23Dec 6, 2023Updated 2 years ago
- Code for paper "Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication"☆23Mar 30, 2024Updated 2 years ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- Open Legal Data Platform☆144Updated this week
- GPT-3.5-trubo + Harvard's Case Access Project☆18Jun 6, 2023Updated 2 years ago
- A spaced repetition learning platform to create, memorize and share your knowledge list using flashcards.☆14Oct 10, 2022Updated 3 years ago
- A dataset for pretraining language models targeted for legal tasks.☆145Jun 30, 2022Updated 3 years ago
- Nano Bots for Obsidian: small, AI-powered bots that can be easily shared as a single file, designed to support multiple providers such as…☆15Jan 13, 2024Updated 2 years ago
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…☆95May 15, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- AllThePatents tooling☆11Mar 23, 2024Updated 2 years ago
- KL3M training data collection and preprocessing☆22Apr 14, 2025Updated last year
- This repository is a collection of legal instruction datasets☆27Jul 12, 2024Updated last year
- Blueprint for federated finetuning, enabling multiple data owners to collaboratively fine-tune models without sharing raw data. Developed…☆41Apr 8, 2026Updated last month
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- GenAI Experimentation☆59Mar 12, 2026Updated 2 months ago
- Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)☆20Sep 21, 2023Updated 2 years ago
- Project automates AI news gathering and Blog post Writing. Our AI agent collects insights and news about any topic From Internet and Wri…☆36Apr 16, 2024Updated 2 years ago
- A Redis-compatible in-memory database server written in Rust with MLua-based Lua 5.1 scripting☆18Nov 28, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This very simple python script takes inputs from your business and outputs articles written bhy claude.☆13Apr 3, 2024Updated 2 years ago
- Python Script for Copywriters to Gather Data from Competing Content and Find Keyword Overlap☆15Apr 23, 2022Updated 4 years ago
- A financial disclosure data extraction tool.☆21Aug 2, 2023Updated 2 years ago
- ☆19Jan 4, 2026Updated 4 months ago
- Python script designed to simplify the process of submitting URLs to Google's Indexing API for faster and more efficient website indexing…☆12Sep 12, 2023Updated 2 years ago
- Explains Canadian Bills☆17May 13, 2023Updated 3 years ago
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆23Dec 28, 2023Updated 2 years ago
- Welcome to GPTNicheFinder, a powerful niche research tool powered by Google Trends data and OpenAI's GPT model. This application provides…☆86Feb 9, 2024Updated 2 years ago
- Legalpioneer dataset☆15Apr 10, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Original schema.org python-appengine codebase☆19Apr 10, 2022Updated 4 years ago
- A python package to create queries to the EU cellar repository and download data.☆13Mar 9, 2026Updated 2 months ago
- The homepage for ConvSearch Dataset.☆14May 31, 2022Updated 3 years ago
- Scripts created by AI Agents to use as tools for data collection. These are examples of Inforensics JITT (Just-In-Time Tool) Agent.☆15Jul 7, 2024Updated last year
- Medical natural language parsing and utility library☆14Dec 10, 2025Updated 5 months ago
- The backend behind the LLM-Perf Leaderboard☆11May 5, 2024Updated 2 years ago
- Python library to work with proxy server items loaded from local file or network document.☆18Dec 21, 2022Updated 3 years ago