thoppe / The-Pile-FreeLawLinks
Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.
☆15Updated 2 years ago
Alternatives and similar repositories for The-Pile-FreeLaw
Users that are interested in The-Pile-FreeLaw are comparing it to the libraries listed below
Sorting:
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆85Updated 2 years ago
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆41Updated 2 years ago
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆43Updated last year
- Aider's refactoring benchmark exercises based on popular python repos☆78Updated last year
- ☆58Updated last year
- ☆31Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆44Updated last year
- Large-language Model Evaluation framework with Elo Leaderboard and A-B testing☆52Updated last year
- Small, simple agent task environments for training and evaluation☆19Updated last year
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆45Updated 2 years ago
- ☆17Updated 8 months ago
- ☆56Updated 5 months ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆54Updated 5 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆45Updated 2 years ago
- ☆160Updated 4 years ago
- Code for constructing TLDR corpus from Reddit dataset☆27Updated 4 years ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆155Updated last year
- ☆21Updated 2 years ago
- ☆53Updated 10 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆23Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆170Updated last year
- ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpu…☆54Updated 2 years ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 10 months ago
- An attribution library for LLMs☆46Updated last year
- Evaluating tool-augmented LLMs in conversation settings☆88Updated last year
- Everything for the Paper: 'Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing'☆19Updated 2 years ago
- Ongoing research training transformer models at scale☆42Updated this week