JoelNiklaus / LegalDatasetsLinks
This repository serves as a collection of scrapers procuring and structuring various legal datasets
☆18Updated 2 years ago
Alternatives and similar repositories for LegalDatasets
Users that are interested in LegalDatasets are comparing it to the libraries listed below
Sorting:
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- A dataset for pretraining language models targeted for legal tasks.☆139Updated 3 years ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆45Updated last year
- ☆14Updated last year
- ☆75Updated last year
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆26Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆29Updated last year
- A personal knowledge base that I can dump information to and help me learn☆24Updated 3 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 10 months ago
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal☆99Updated 2 years ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆32Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆33Updated 3 weeks ago
- Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.☆124Updated last week
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI☆114Updated 2 years ago
- Text to Python Objects via a LLM Function Call☆58Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated 10 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 6 months ago
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆76Updated 2 years ago
- ☆94Updated last year
- Universal text classifier for generative models☆24Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated last year
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆40Updated last year
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"☆27Updated 2 years ago
- ☆20Updated 2 years ago
- ☆20Updated last year
- A collection of datasets and other resources for legal text processing.☆121Updated 2 weeks ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆85Updated this week