JoelNiklaus / LegalDatasetsLinks
This repository serves as a collection of scrapers procuring and structuring various legal datasets
β17Updated 2 years ago
Alternatives and similar repositories for LegalDatasets
Users that are interested in LegalDatasets are comparing it to the libraries listed below
Sorting:
- Explore the use of DSPy for extracting features from PDFs πβ40Updated last year
- Writing Blog Posts with Generative Feedback Loops!β48Updated last year
- Streamlit app for recommending eval functions using prompt diffsβ27Updated last year
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"β28Updated 2 years ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 11 months ago
- π Unstructured Data Connectors for Haystack 2.0β17Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ13Updated 10 months ago
- β8Updated 11 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β47Updated 9 months ago
- A Discord Bot for distilling papers, GitHub repos, Blogposts, and much more using the power of LLMs and vector search.β13Updated 2 years ago
- β20Updated last year
- β19Updated last year
- Universal text classifier for generative modelsβ24Updated 10 months ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.β26Updated last year
- Create a music review RAG application with Neo4jβ19Updated last year
- Example for Logging LLM Evaluator Prompt Responsesβ15Updated last year
- ChatBot App built using LangChain and Lightning AIβ18Updated 2 years ago
- β23Updated last year
- π©π€π€ A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)β23Updated 2 years ago
- β Pytest-style test runner for langchain projectsβ25Updated 2 years ago
- Unstract's interface to LLMs, Embeddings and VectorDBs.β18Updated 11 months ago
- β46Updated 9 months ago
- β14Updated last year
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpusβ14Updated 4 years ago
- Table detection with Florence.β14Updated 11 months ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engineβ31Updated 3 years ago
- Mahabharata text compiled from multiple sources, split into chunks, parsed into CSV files with metadata. Named entities recognised and inβ¦β35Updated last year
- Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"β41Updated 2 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agentsβ24Updated 3 years ago
- β75Updated last year