Implementations of Curious Replay for model-based adaptation.
☆43Jul 5, 2023Updated 2 years ago
Alternatives and similar repositories for curiousreplay
Users that are interested in curiousreplay are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆40Jul 5, 2023Updated 2 years ago
- [ICLR 2026] From Observations to Events: Event-Aware World Models for Reinforcement Learning☆48May 30, 2026Updated last month
- [AAAI 2026] WorldRFT: Latent World Model Planning with Reinforcement Fine-Tuning for Autonomous Driving☆39Dec 23, 2025Updated 6 months ago
- Official repository for our paper on "Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models"☆13Dec 4, 2023Updated 2 years ago
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆55Jun 27, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The official implementation of InfoRM [NeurIPS 2024].☆16Oct 25, 2025Updated 8 months ago
- Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning☆11Jun 8, 2020Updated 6 years ago
- Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…☆14Aug 8, 2025Updated 10 months ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆22Jul 14, 2024Updated last year
- Code for Policy Bifurcation in Safe Reinforcement Learning☆10Jul 4, 2025Updated last year
- ☆30Jun 6, 2024Updated 2 years ago
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 11 months ago
- PyTorch implementation of DDPG with Hindsight Experience Replay (HER)☆10Oct 21, 2019Updated 6 years ago
- ☆21Jun 27, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆21Jul 9, 2025Updated 11 months ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆29Oct 14, 2025Updated 8 months ago
- Julia implementation of the Flash Attention algorithm☆18Sep 4, 2023Updated 2 years ago
- Imitation Learning via Differentiable Physics☆44Aug 6, 2022Updated 3 years ago
- [CoRL 25] Official implementation of SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL☆19Sep 13, 2025Updated 9 months ago
- (T-IV) Dream to Drive with Predictive Individual World Model☆47Aug 8, 2025Updated 10 months ago
- ☆14Dec 11, 2018Updated 7 years ago
- Simple stupid C++ interop☆26Apr 25, 2022Updated 4 years ago
- Awk-like tool using python☆10Aug 4, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An implementation of a neural network training routine using derivative information in Pytorch.☆11Dec 19, 2020Updated 5 years ago
- ☆11Nov 27, 2025Updated 7 months ago
- Causal Analysis of Agent Behavior for AI Safety☆20Jun 27, 2023Updated 3 years ago
- [ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm☆27May 18, 2025Updated last year
- Bayesian Inverse Graphics for Few-Shot Concept Learning☆12Mar 16, 2025Updated last year
- A Python CLI game and library for Tic-tac-toe.☆10Apr 4, 2017Updated 9 years ago
- Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Inte…☆22Jun 3, 2024Updated 2 years ago
- Example code for NeurIPS 2022 paper "Differentiable Analog Quantum Computing for Learning and Control"☆16Apr 6, 2023Updated 3 years ago
- CoMAL: Collaboration Collaborative Multi-Agent Large Language Models for Mixed-Autonomy Traffic☆28Jan 14, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- d-EVD-dual-electric-vehicle-dataset☆13Apr 24, 2026Updated 2 months ago
- Multi Agent Traffic Scenario Gym: A scenario-based training and evaluation framework for CARLA.☆51Jul 3, 2024Updated 2 years ago
- Cayley Dickson algebra implementation in python☆13Jan 3, 2019Updated 7 years ago
- Thinker project☆16Sep 4, 2024Updated last year
- Addition to carla_ros_bridge to convert carla messages to autoware messages☆38Apr 29, 2024Updated 2 years ago
- [ICML2025] Official codebase for "TeLoGraF: Temporal Logic Planning via Graph-encoded Flow Matching"☆21Jul 14, 2025Updated 11 months ago
- These will be public notes for courses that I'm self-studying.☆27Jun 22, 2020Updated 6 years ago