thoppe / The-Pile-FreeLawLinks

Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.

☆11

Alternatives and similar repositories for The-Pile-FreeLaw

Users that are interested in The-Pile-FreeLaw are comparing it to the libraries listed below

Sorting:

akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
EleutherAI / stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
☆83Updated last year
EleutherAI / lm_perplexity
☆151Updated 4 years ago
LLM360 / Analysis360
Open Implementations of LLM Analyses
☆105Updated 9 months ago
QuixiAI / SystemChat
☆30Updated last year
jina-ai / jerboa
LLM finetuning
☆42Updated last year
DerwenAI / textgraphs
TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph
☆24Updated last year
neoxelox / dspy-inspector
DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.
☆36Updated last year
fblgit / tree-of-knowledge
ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpu…
☆53Updated 2 years ago
CarperAI / Code-Pile
This repository contains all the code for collecting large scale amounts of code from GitHub.
☆110Updated 2 years ago
allenai / adapt-demos
Lightweight tools for quick and easy LLM demo's
☆28Updated 9 months ago
deepsearch-ai / deepsearch
A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images
☆40Updated last year
sdtblck / youtube_subtitle_dataset
YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training
☆43Updated 4 years ago
agential-ai / agential
🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!
☆52Updated last week
LLM360 / crystalcoder-train
Pre-training code for CrystalCoder 7B LLM
☆54Updated last year
the-crypt-keeper / tiny_starcoder
Python examples using the bigcode/tiny_starcoder_py 159M model to generate code
☆44Updated 2 years ago
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆106Updated 7 months ago
argilla-io / awesome-llm-datasets
👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)
☆23Updated 2 years ago
leap-laboratories / PIZZA
An attribution library for LLMs
☆42Updated 10 months ago
Agora-Lab-AI / Orca
An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"
☆43Updated 9 months ago
noanabeshima / wikipedia-downloader
Downloads 2020 English Wikipedia articles as plaintext
☆25Updated 2 years ago
EleutherAI / openwebtext2
☆90Updated 3 years ago
OSU-NLP-Group / SeeActChromeExtension
☆16Updated 6 months ago
stanford-oval / chainlite
LangChain + LiteLLM that works
☆44Updated last month
weaviate-tutorials / Hurricane
Writing Blog Posts with Generative Feedback Loops!
☆49Updated last year
oughtinc / primer
Factored Cognition Primer: How to write compositional language model programs
☆49Updated 2 years ago
oriyor / assistantbench
Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"
☆58Updated 7 months ago
kddubey / cappr
Completion After Prompt Probability. Make your LLM make a choice
☆79Updated 8 months ago
simonw / llm-cluster
LLM plugin for clustering embeddings
☆77Updated last year
The-Swarm-Corporation / swarm-models
A simple to use package to call various model providers such as openai, anthropic, and others with utmost reliability, security, and perf…
☆13Updated 3 months ago