Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.
☆15Jun 3, 2023Updated 2 years ago
Alternatives and similar repositories for The-Pile-FreeLaw
Users that are interested in The-Pile-FreeLaw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Downloads 2020 English Wikipedia articles as plaintext☆27Mar 25, 2023Updated 3 years ago
- Script for downloading GitHub.☆13Sep 24, 2020Updated 5 years ago
- bulk image downloader freeware, reddit bulk image downloader, bulk image downloader extension, bulk image downloader from url, bulk image…☆25Feb 19, 2026Updated last month
- URL downloader supporting checkpointing and continuous checksumming.☆19Nov 29, 2023Updated 2 years ago
- A script for collecting the PubMed Central dataset in a language modelling friendly format.☆25Feb 16, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models☆13Nov 2, 2023Updated 2 years ago
- downloads and parses subtitle dataset from opensubtitles.org☆15Apr 19, 2024Updated last year
- ☆10Feb 6, 2025Updated last year
- ☆27Oct 31, 2025Updated 4 months ago
- Simple migration engine for Peewee☆19Updated this week
- ☆95Jul 16, 2022Updated 3 years ago
- Official resources of "The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reaso…☆20Jun 13, 2025Updated 9 months ago
- Original implementation of SmartRAG: Jointly Learn RAG-Related Tasks From the Environment Feedback (ICLR 2025)☆17Feb 17, 2025Updated last year
- Memory Agent monorepo☆84Oct 9, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆11Feb 23, 2024Updated 2 years ago
- Examples related to Amazon Lightsail☆12Jul 17, 2024Updated last year
- Harness CI migration utility☆11Mar 19, 2026Updated last week
- ☆14Feb 11, 2022Updated 4 years ago
- ☆21Jul 25, 2025Updated 8 months ago
- Chakra UI Animations is a dependancy which offers you pre-built animations for your Chakra UI components.☆14Oct 18, 2022Updated 3 years ago
- YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training☆46Sep 22, 2020Updated 5 years ago
- ☆23Feb 11, 2026Updated last month
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆11Nov 28, 2015Updated 10 years ago
- The best terminal chat client for your live streams☆19Jun 10, 2023Updated 2 years ago
- ALAS: Autonomous Learning Agent System☆15Aug 14, 2025Updated 7 months ago
- Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly throug…☆43Jan 6, 2021Updated 5 years ago
- ☆22Feb 9, 2023Updated 3 years ago
- Dataset of Canada goose images with annotations of bounding boxes with object classes, suitable for testing object detection algorithms.☆40Aug 2, 2018Updated 7 years ago
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆33Oct 16, 2023Updated 2 years ago
- FastAPI Microservices Architecture SDK - As Basis for multiple services in a platform/system☆12Oct 4, 2022Updated 3 years ago
- MediaWiki Categories Model☆13Feb 14, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Mar 22, 2026Updated last week
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Aug 5, 2025Updated 7 months ago
- BDD testing of pippo demo with Cucumber and Serenity☆12Nov 10, 2015Updated 10 years ago
- ☆14Feb 19, 2016Updated 10 years ago
- Example project to demonstrate serving Next.js app with NGINX and Docker☆16Nov 22, 2020Updated 5 years ago
- one-click deepfake (face swap)☆10May 30, 2023Updated 2 years ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Jan 16, 2025Updated last year