thoppe / The-Pile-FreeLaw
Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.
☆11Updated last year
Alternatives and similar repositories for The-Pile-FreeLaw
Users that are interested in The-Pile-FreeLaw are comparing it to the libraries listed below
Sorting:
- Lightweight tools for quick and easy LLM demo's☆26Updated 7 months ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆82Updated last year
- ☆20Updated last year
- Efficiently computing & storing token n-grams from large corpora☆23Updated 7 months ago
- ☆11Updated last year
- LLM finetuning☆42Updated last year
- ☆15Updated last month
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Downloads 2020 English Wikipedia articles as plaintext☆25Updated 2 years ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆34Updated last year
- ☆43Updated 3 months ago
- ☆90Updated 2 years ago
- Simple Model Similarities Analysis☆21Updated last year
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Updated 2 years ago
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models☆17Updated last month
- ☆15Updated 4 months ago
- ☆57Updated 7 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆23Updated 2 months ago
- Script for downloading GitHub.☆93Updated 10 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- ☆30Updated 10 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 7 months ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated 2 years ago
- Run SWE-bench evaluations remotely☆13Updated this week
- ☆48Updated 6 months ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆16Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆43Updated last year
- Experimental sampler to make LLMs more creative☆31Updated last year
- Small, simple agent task environments for training and evaluation☆18Updated 6 months ago
- Pre-training code for CrystalCoder 7B LLM☆54Updated last year