Implementations of Curious Replay for model-based adaptation.
☆43Jul 5, 2023Updated 2 years ago
Alternatives and similar repositories for curiousreplay
Users that are interested in curiousreplay are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆40Jul 5, 2023Updated 2 years ago
- [AAAI 2026] WorldRFT: Latent World Model Planning with Reinforcement Fine-Tuning for Autonomous Driving☆38Dec 23, 2025Updated 5 months ago
- Official repository for our paper on "Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models"☆13Dec 4, 2023Updated 2 years ago
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆54Jun 27, 2024Updated last year
- MJCF Importer Extension☆18Jul 24, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models☆25May 11, 2024Updated 2 years ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆22Jul 14, 2024Updated last year
- Code for Policy Bifurcation in Safe Reinforcement Learning☆10Jul 4, 2025Updated 11 months ago
- ☆27Aug 16, 2025Updated 9 months ago
- ☆29Jun 6, 2024Updated 2 years ago
- PyTorch implementation of DDPG with Hindsight Experience Replay (HER)☆10Oct 21, 2019Updated 6 years ago
- Official modding tool for PokeWilds.☆10Jun 30, 2022Updated 3 years ago
- Vector Bazel Rules and Toolchains☆16Mar 2, 2026Updated 3 months ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆28Oct 14, 2025Updated 8 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Julia implementation of the Flash Attention algorithm☆19Sep 4, 2023Updated 2 years ago
- Simple stupid C++ interop☆26Apr 25, 2022Updated 4 years ago
- Second Generation of Large Language Models☆21Jun 30, 2025Updated 11 months ago
- An implementation of a neural network training routine using derivative information in Pytorch.☆11Dec 19, 2020Updated 5 years ago
- ☆23Apr 2, 2024Updated 2 years ago
- ☆11Nov 27, 2025Updated 6 months ago
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆33Feb 6, 2023Updated 3 years ago
- Causal Analysis of Agent Behavior for AI Safety☆20Jun 27, 2023Updated 2 years ago
- [ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm☆27May 18, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Jun 10, 2024Updated 2 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- ☆90Aug 21, 2023Updated 2 years ago
- Partial set of hardware designs for a Meta-developed brain computer interface (BCI) research prototype system (Spotlight).☆16Mar 23, 2023Updated 3 years ago
- Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Inte…☆22Jun 3, 2024Updated 2 years ago
- A paper list of sample-efficient reinforcement learning☆19Jan 12, 2022Updated 4 years ago
- An exploration of artificial intelligence, with the help of math, history and Python☆18Nov 30, 2017Updated 8 years ago
- Example code for NeurIPS 2022 paper "Differentiable Analog Quantum Computing for Learning and Control"☆16Apr 6, 2023Updated 3 years ago
- C++-accelerated Frenet Trajectory Planning Handler☆47Nov 4, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Multi Agent Traffic Scenario Gym: A scenario-based training and evaluation framework for CARLA.☆50Jul 3, 2024Updated last year
- Implementation of R2-Dreamer.☆111May 31, 2026Updated 2 weeks ago
- Inverse Kinematics of a 7dof manipulator☆15Jun 3, 2024Updated 2 years ago
- Thinker project☆16Sep 4, 2024Updated last year
- [ICML2025] Official codebase for "TeLoGraF: Temporal Logic Planning via Graph-encoded Flow Matching"☆20Jul 14, 2025Updated 11 months ago
- A RL benchmark framework based on real world problem☆13Jun 28, 2023Updated 2 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆20Feb 9, 2024Updated 2 years ago