Implementations of Curious Replay for model-based adaptation.
☆43Jul 5, 2023Updated 2 years ago
Alternatives and similar repositories for curiousreplay
Users that are interested in curiousreplay are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆39Jul 5, 2023Updated 2 years ago
- [AAAI 2026] WorldRFT: Latent World Model Planning with Reinforcement Fine-Tuning for Autonomous Driving☆35Dec 23, 2025Updated 3 months ago
- Official repository for our paper on "Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models"☆13Dec 4, 2023Updated 2 years ago
- MJCF Importer Extension☆18Jul 24, 2025Updated 8 months ago
- Description☆21Jan 24, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Time travel in JuliaLang. (Useful for testing what code did before your changes)☆17Sep 3, 2023Updated 2 years ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆21Jul 14, 2024Updated last year
- Code for Policy Bifurcation in Safe Reinforcement Learning☆10Jul 4, 2025Updated 9 months ago
- ☆28Jun 6, 2024Updated last year
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 9 months ago
- PyTorch implementation of DDPG with Hindsight Experience Replay (HER)☆10Oct 21, 2019Updated 6 years ago
- A complete Undertale Mod Tool for Android☆18Updated this week
- ☆21Jun 27, 2024Updated last year
- ☆21Jul 9, 2025Updated 9 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability☆40Feb 23, 2026Updated last month
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆27Oct 14, 2025Updated 6 months ago
- Julia implementation of the Flash Attention algorithm☆19Sep 4, 2023Updated 2 years ago
- ☆14Dec 11, 2018Updated 7 years ago
- Simple stupid C++ interop☆26Apr 25, 2022Updated 3 years ago
- ☆23Apr 2, 2024Updated 2 years ago
- Awk-like tool using python☆11Aug 4, 2020Updated 5 years ago
- An implementation of a neural network training routine using derivative information in Pytorch.☆10Dec 19, 2020Updated 5 years ago
- Second Generation of Large Language Models☆21Jun 30, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆32Feb 6, 2023Updated 3 years ago
- Causal Analysis of Agent Behavior for AI Safety☆20Jun 27, 2023Updated 2 years ago
- Bayesian Inverse Graphics for Few-Shot Concept Learning☆12Mar 16, 2025Updated last year
- rddapp: Regression Discontinuity Design Application☆11Sep 2, 2025Updated 7 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Jun 10, 2024Updated last year
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- A follower to litable☆15Jul 22, 2016Updated 9 years ago
- Implementation of R2-Dreamer.☆69Mar 5, 2026Updated last month
- Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Inte…☆22Jun 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A paper list of sample-efficient reinforcement learning☆18Jan 12, 2022Updated 4 years ago
- Official repository of "Spontaneous symmetry breaking in generative diffusion models"☆43May 22, 2024Updated last year
- CoMAL: Collaboration Collaborative Multi-Agent Large Language Models for Mixed-Autonomy Traffic☆29Jan 14, 2025Updated last year
- C++-accelerated Frenet Trajectory Planning Handler☆48Nov 4, 2025Updated 5 months ago
- An exploration of artificial intelligence, with the help of math, history and Python☆18Nov 30, 2017Updated 8 years ago
- Example code for NeurIPS 2022 paper "Differentiable Analog Quantum Computing for Learning and Control"☆16Apr 6, 2023Updated 3 years ago
- d-EVD-dual-electric-vehicle-dataset☆13Aug 21, 2025Updated 7 months ago