thoppe / The-Pile-FreeLawLinks
Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.
☆11Updated 2 years ago
Alternatives and similar repositories for The-Pile-FreeLaw
Users that are interested in The-Pile-FreeLaw are comparing it to the libraries listed below
Sorting:
- ☆15Updated 2 months ago
- Lightweight tools for quick and easy LLM demo's☆28Updated 9 months ago
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models☆18Updated 2 months ago
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆29Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 7 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated last year
- Pre-training code for CrystalCoder 7B LLM☆54Updated last year
- ☆18Updated 9 months ago
- Downloads 2020 English Wikipedia articles as plaintext☆25Updated 2 years ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆81Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- Reasoning by Communicating with Agents☆29Updated last month
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆26Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 5 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆65Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated 8 months ago
- ☆47Updated 4 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆35Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- ☆20Updated last year
- ☆51Updated 7 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 7 months ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆43Updated 2 years ago
- ☆57Updated 9 months ago
- ☆30Updated 11 months ago