[NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow
☆43Nov 2, 2024Updated last year
Alternatives and similar repositories for meow
Users that are interested in meow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆16Nov 18, 2020Updated 5 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- ☆12Sep 7, 2024Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆89Jun 4, 2024Updated last year
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆64Apr 4, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A simplistic implementation of DQN that works under CartPole-v0 with rendered pixels as input☆13Feb 28, 2019Updated 7 years ago
- Deep Reinforcement Learning algorithms for Policy Value methods written from scratch.☆22Aug 27, 2020Updated 5 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Jul 18, 2025Updated 8 months ago
- Code base for publication: Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems☆10Feb 1, 2023Updated 3 years ago
- Official Pytorch Implementation of CMLO in the paper ”When to Update Your Model: Constrained Model-based Reinforcement Learning“☆10Nov 2, 2023Updated 2 years ago
- Code for our TMLR paper "Distributional GFlowNets with Quantile Flows".☆13Feb 14, 2024Updated 2 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Jan 22, 2021Updated 5 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆95Dec 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆18Jun 8, 2023Updated 2 years ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆144Jun 23, 2025Updated 9 months ago
- Prioritized Generative Replay (ICLR 2025 Oral)☆28Mar 1, 2025Updated last year
- Official implementation for pi0 steering via DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)☆222Aug 5, 2025Updated 7 months ago
- Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models☆25May 11, 2024Updated last year
- A data-driven, fast driving simulator for multi-agent coordination under partial observability.☆37Aug 29, 2024Updated last year
- ☆24Mar 10, 2024Updated 2 years ago
- Data-driven discovery of a novel sepsis pre-shock state predicts impending septic shock in the ICU☆17Jan 23, 2021Updated 5 years ago
- ☆123May 30, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Quantum Principal Component Analysis (QPCA) as a generative model☆13Apr 5, 2022Updated 3 years ago
- [ICML 2021] Learning Task Informed Abstractions -- a representation learning approach for model-based RL in complex visual domains☆18Jul 20, 2021Updated 4 years ago
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆22Nov 29, 2025Updated 4 months ago
- [ICLR 2023] Choreographer: a world-model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able t…☆42Jun 18, 2024Updated last year
- [TVCG 2024] ReactFace: Online Multiple Appropriate Facial Reaction Generation in Dyadic Interactions☆21Feb 28, 2025Updated last year
- PWM: Policy Learning with Large World Models☆65Aug 4, 2025Updated 7 months ago
- ☆35Aug 26, 2025Updated 7 months ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Deep Q-Network (DQN) with Prioritized Experience Replay (PER)☆17Jan 1, 2020Updated 6 years ago
- ☆127Aug 9, 2023Updated 2 years ago
- Code for the paper: Comparing and Contrasting Deep Learning Weather Prediction Backbones on Navier-Stokes and Atmospheric Dynamics☆15Aug 9, 2024Updated last year
- Implementation of Continuous Control RL Algorithms☆11Dec 8, 2022Updated 3 years ago
- Policy Gradient Actor-Critic PyTorch | Lunar Lander v2☆75May 7, 2019Updated 6 years ago
- ☆26Apr 26, 2024Updated last year
- Implicit Normalizing Flows + Reinforcement Learning☆62May 31, 2019Updated 6 years ago