andrewgcodes / onpolicydistillationView external linksLinks
☆27Oct 30, 2025Updated 3 months ago
Alternatives and similar repositories for onpolicydistillation
Users that are interested in onpolicydistillation are comparing it to the libraries listed below
Sorting:
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- CIFAR-10 speedrun: Trains to 94% accuracy in 1.98 seconds on a single NVIDIA A100 GPU.☆56Oct 17, 2025Updated 3 months ago
- A simple mobile/native monorepo template w/ a sync engine.☆15Feb 3, 2026Updated last week
- prinzbench is a private benchmark that ranks LLMs based on their ability to conduct legal research and analysis and locate obscure public…☆32Feb 8, 2026Updated last week
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆95Feb 5, 2026Updated last week
- Simple demo ilustrating the use of LSTM neural network to predict daily changes in the Ethereum cryptocurrency☆10Jan 23, 2018Updated 8 years ago
- This is MPE-pytorch, fix some bugs.☆10Apr 26, 2020Updated 5 years ago
- lol☆10Mar 12, 2021Updated 4 years ago
- GPU accelerated Perlin Noise in python☆11Oct 23, 2020Updated 5 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- Learn one, get them all for free☆12Jan 28, 2024Updated 2 years ago
- ☆22Dec 18, 2025Updated last month
- Pytorch Implementation of the Distributed SAC. Test environment is LunarLanderContinuous-v2 and Metaworld MT1, MT10☆12Apr 6, 2022Updated 3 years ago
- Search, download Vimeo videos and retrieve metadata in Go.☆11Feb 10, 2022Updated 4 years ago
- tuimorphic choose-your-own-adventure story game☆15Jan 19, 2026Updated 3 weeks ago
- An implementation of Self-Calibrating Conformal Prediction, accepted to Neurips 2024. SC-CP combines Venn-Abers calibration and conformal…☆10Jan 28, 2025Updated last year
- FoKL-GP implements Karhunen-Loève decomposed Gaussian processes with built-in forward variable selection. Decomposed GPs are key to embed…☆18Dec 6, 2025Updated 2 months ago
- ☆12Mar 3, 2023Updated 2 years ago
- ☆10Nov 27, 2019Updated 6 years ago
- Creative AI for Visual Art and Music slides and demos.☆11May 2, 2023Updated 2 years ago
- Official collection of Typescript packages for interacting with Autumn☆22Updated this week
- ☆22May 9, 2025Updated 9 months ago
- ☆11Feb 13, 2024Updated 2 years ago
- ☆13Dec 16, 2024Updated last year
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Jul 4, 2018Updated 7 years ago
- ☆11Nov 30, 2024Updated last year
- QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …☆13Mar 25, 2024Updated last year
- ☆23Jun 5, 2025Updated 8 months ago
- Self-contained RTL to GDS flow for simple chip designs☆49Jan 27, 2026Updated 2 weeks ago
- transliterate hindi to english☆15Dec 21, 2025Updated last month
- A mini tutorial on visualizing simulations from the phiflow differentiable fluid solver in Blender.☆15Oct 19, 2021Updated 4 years ago
- ☆10Jul 15, 2024Updated last year
- Repository for the EuroSciPy sprint☆14May 21, 2023Updated 2 years ago
- James' cookbook of evaluations and finetuning experiments☆21Feb 3, 2026Updated last week
- init☆13Dec 4, 2024Updated last year
- ☆15Jun 6, 2022Updated 3 years ago
- A dataset with classified film shots☆11Aug 8, 2022Updated 3 years ago
- 3D Multiblock multiphysics finite volume reacting flow solver. Implemented in Python, Kokkos, and MPI for inter- and intra-node performan…☆10Jan 9, 2025Updated last year