JoelNiklaus / LegalDatasetsLinks
This repository serves as a collection of scrapers procuring and structuring various legal datasets
β17Updated 2 years ago
Alternatives and similar repositories for LegalDatasets
Users that are interested in LegalDatasets are comparing it to the libraries listed below
Sorting:
- Explore the use of DSPy for extracting features from PDFs πβ45Updated last year
- Text to Python Objects via a LLM Function Callβ58Updated last year
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- Mixing Language Models with Self-Verification and Meta-Verificationβ105Updated 7 months ago
- A dataset for pretraining language models targeted for legal tasks.β134Updated 3 years ago
- Streamlit app for recommending eval functions using prompt diffsβ29Updated last year
- β75Updated last year
- Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.β120Updated this week
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legalβ94Updated 2 years ago
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPIβ115Updated last year
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.β26Updated last year
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpusβ14Updated 4 years ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agentsβ24Updated 3 years ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β32Updated 11 months ago
- Example for Logging LLM Evaluator Prompt Responsesβ18Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ12Updated 11 months ago
- β20Updated last year
- LLM finetuningβ42Updated last year
- π Unstructured Data Connectors for Haystack 2.0β17Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β75Updated 9 months ago
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held β¦β41Updated 2 years ago
- β14Updated last year
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Developmentβ20Updated 2 years ago
- Universal text classifier for generative modelsβ24Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β66Updated 9 months ago
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.β74Updated last year
- Verbosity control for AI agentsβ64Updated last year
- Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"β42Updated 4 months ago
- β Pytest-style test runner for langchain projectsβ25Updated 2 years ago