JoelNiklaus / LegalDatasetsLinks
This repository serves as a collection of scrapers procuring and structuring various legal datasets
β17Updated 2 years ago
Alternatives and similar repositories for LegalDatasets
Users that are interested in LegalDatasets are comparing it to the libraries listed below
Sorting:
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- Explore the use of DSPy for extracting features from PDFs πβ47Updated last year
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.β26Updated 2 years ago
- Streamlit app for recommending eval functions using prompt diffsβ29Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 11 months ago
- A dataset for pretraining language models targeted for legal tasks.β138Updated 3 years ago
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legalβ99Updated 2 years ago
- π Unstructured Data Connectors for Haystack 2.0β17Updated 2 years ago
- Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.β124Updated last week
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β46Updated last year
- Universal text classifier for generative modelsβ25Updated last year
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPIβ113Updated 2 years ago
- π©π€π€ A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)β24Updated 2 years ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β50Updated last year
- Text to Python Objects via a LLM Function Callβ58Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β33Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ12Updated last year
- β73Updated last year
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and imagesβ41Updated last year
- Lightweight Non-Parametric Embedding Fine-Tuningβ36Updated last month
- Python client library for improving your LLM app accuracyβ97Updated 8 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)β88Updated last month
- A personal knowledge base that I can dump information to and help me learnβ24Updated 5 months ago
- Mixing Language Models with Self-Verification and Meta-Verificationβ109Updated 10 months ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA onβ¦β47Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β41Updated last year
- π A list of Haystack Integrations, maintained by the community or deepset.β98Updated this week
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β78Updated 11 months ago
- Chunk your text using gpt4o-mini more accuratelyβ44Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β34Updated 2 months ago