[NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow
☆43Nov 2, 2024Updated last year
Alternatives and similar repositories for meow
Users that are interested in meow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆16Nov 18, 2020Updated 5 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- [ICLR 2022] Denoising Likelihood Score Matching for Conditional Score-based Data Generation☆11Jan 2, 2025Updated last year
- ☆20May 20, 2026Updated last week
- ☆12Sep 7, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆63Apr 4, 2023Updated 3 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆95Jun 4, 2024Updated last year
- A simplistic implementation of DQN that works under CartPole-v0 with rendered pixels as input☆13Feb 28, 2019Updated 7 years ago
- Deep Reinforcement Learning algorithms for Policy Value methods written from scratch.☆22Aug 27, 2020Updated 5 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Jul 18, 2025Updated 10 months ago
- Official Pytorch Implementation of CMLO in the paper ”When to Update Your Model: Constrained Model-based Reinforcement Learning“☆10Nov 2, 2023Updated 2 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆40Jan 22, 2021Updated 5 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆94Dec 13, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆18Jun 8, 2023Updated 2 years ago
- Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong…☆11Jun 18, 2018Updated 7 years ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆150Apr 7, 2026Updated last month
- Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models☆25May 11, 2024Updated 2 years ago
- ☆63May 19, 2026Updated last week
- A data-driven, fast driving simulator for multi-agent coordination under partial observability.☆37Aug 29, 2024Updated last year
- Official implementation for pi0 steering via DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)☆261Apr 27, 2026Updated last month
- official implementation of [CE-Nav: Flow-Guided Reinforcement Refinement for Cross-Embodiment Local Navigation]☆41Mar 17, 2026Updated 2 months ago
- Quantum Principal Component Analysis (QPCA) as a generative model☆13Apr 5, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICML 2021] Learning Task Informed Abstractions -- a representation learning approach for model-based RL in complex visual domains☆18Jul 20, 2021Updated 4 years ago
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆23Nov 29, 2025Updated 6 months ago
- [ICLR 2023] Choreographer: a world-model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able t…☆43Jun 18, 2024Updated last year
- ☆20Apr 24, 2026Updated last month
- Resilient Model-Based RL by Regularizing Posterior Predictability☆22Mar 4, 2024Updated 2 years ago
- PWM: Policy Learning with Large World Models☆68Aug 4, 2025Updated 9 months ago
- Code for Understanding and Mitigating Exploding Inverses in Invertible Neural Networks (AISTATS 2021) http://arxiv.org/abs/2006.09347☆31Aug 29, 2020Updated 5 years ago
- ☆35Aug 26, 2025Updated 9 months ago
- Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024☆27Apr 26, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- ☆128Aug 9, 2023Updated 2 years ago
- Qiskit camp 2019 hackathon: Using QAOA for solving the graph coloring problem☆11May 21, 2019Updated 7 years ago
- CoRL 2024 🧠☆33Jun 25, 2025Updated 11 months ago
- Policy Gradient Actor-Critic PyTorch | Lunar Lander v2☆76May 7, 2019Updated 7 years ago
- ☆27Apr 26, 2024Updated 2 years ago