tristandeleu / gfn-maxent-rl
Comparison between GFlowNets & Maximum Entropy RL
☆16Updated last year
Alternatives and similar repositories for gfn-maxent-rl:
Users that are interested in gfn-maxent-rl are comparing it to the libraries listed below
- Repository for "Generative Flow Networks as Entropy-Regularized RL" (AISTATS-2024, Oral)☆30Updated 10 months ago
- A PyTorch implementation of a Generative Flow Network (GFlowNet) proposed by Bengio et al. (2021)☆42Updated last year
- Code for paper "Compositional Sculpting of Iterative Generative Processes"☆20Updated last year
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆95Updated last year
- ☆23Updated 4 months ago
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆81Updated last year
- ☆24Updated last year
- Code for "Bayesian Structure Learning with Generative Flow Networks"☆83Updated 2 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- GflowNets, MCMC, Metropolis-Hasting, Gibbs sampling, Metropolis-adjusted Langevin, Inverse Transform Sampling, Acceptance-Rejection Metho…☆86Updated 2 years ago
- iQRL: implicitly Quantized Representations for Sample-efficient Reinforcement Learning☆9Updated last month
- Code release for "Stochastic Optimal Control Matching"☆30Updated 6 months ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆55Updated last year
- ☆73Updated 3 months ago
- [ICML 2024] Official implementation for "Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling".☆31Updated 2 months ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- ☆56Updated 2 years ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆39Updated 7 months ago
- Artificial Kuramoto Oscillatory Neurons☆52Updated this week
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 3 months ago
- ☆29Updated 4 years ago
- Learning diverse options through the Laplacian representation.☆23Updated last year
- An Open-Ended Agentic Simulator☆39Updated 6 months ago
- Codebase to fully reproduce the results of "No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO" (M…☆25Updated 3 months ago
- ☆48Updated 3 years ago
- Simple tools to mix and match PyTorch and Jax - Get the best of both worlds!☆21Updated 2 months ago
- ☆71Updated 6 months ago
- POPGym Library in JAX☆11Updated 10 months ago
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆13Updated last year