thoppe / The-Pile-FreeLawLinks
Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.
☆15Updated 2 years ago
Alternatives and similar repositories for The-Pile-FreeLaw
Users that are interested in The-Pile-FreeLaw are comparing it to the libraries listed below
Sorting:
- ☆56Updated 6 months ago
- ☆32Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆86Updated 2 years ago
- ☆160Updated 4 years ago
- Mixing Language Models with Self-Verification and Meta-Verification☆111Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆65Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆73Updated last year
- ☆17Updated 9 months ago
- A forest of autonomous agents.☆19Updated 11 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆32Updated last year
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆41Updated 2 years ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆43Updated last year
- Pre-training code for CrystalCoder 7B LLM☆56Updated last year
- Evaluating tool-augmented LLMs in conversation settings☆88Updated last year
- ☆53Updated last year
- Automated Qualitative Analysis of LLMs (ICLR 2025)☆53Updated 6 months ago
- ☆59Updated last year
- Reward Model framework for LLM RLHF☆62Updated 2 years ago
- Efficiently computing & storing token n-grams from large corpora☆26Updated last year
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆53Updated 6 months ago
- ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpu…☆55Updated 2 years ago
- ☆21Updated 2 years ago
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆110Updated 2 years ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Updated 2 years ago
- Downloads 2020 English Wikipedia articles as plaintext☆25Updated 2 years ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆155Updated last year
- ☆92Updated 3 years ago
- An attribution library for LLMs☆46Updated last year