tristandeleu / gfn-maxent-rl
Comparison between GFlowNets & Maximum Entropy RL
☆15Updated 10 months ago
Alternatives and similar repositories for gfn-maxent-rl:
Users that are interested in gfn-maxent-rl are comparing it to the libraries listed below
- Artificial Kuramoto Oscillatory Neurons☆45Updated this week
- A PyTorch implementation of a Generative Flow Network (GFlowNet) proposed by Bengio et al. (2021)☆41Updated last year
- Repository for "Generative Flow Networks as Entropy-Regularized RL" (AISTATS-2024, Oral)☆30Updated 8 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆94Updated last year
- Code for paper "Compositional Sculpting of Iterative Generative Processes"☆20Updated last year
- ☆55Updated 2 years ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆81Updated last year
- An Open-Ended Agentic Simulator☆36Updated 5 months ago
- ☆20Updated 3 months ago
- ☆23Updated last year
- ☆66Updated 4 months ago
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆13Updated last year
- ☆29Updated 3 years ago
- Codebase to fully reproduce the results of "No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO" (M…☆24Updated last month
- Code for "Bayesian Structure Learning with Generative Flow Networks"☆82Updated 2 years ago
- [ICML 2024] Official implementation for "Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling".☆28Updated last month
- ☆12Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆47Updated last year
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆42Updated last month
- Pytorch-like dataloaders in JAX.☆67Updated 2 months ago
- Code for our TMLR paper "Distributional GFlowNets with Quantile Flows".☆9Updated 11 months ago
- ☆18Updated last month
- Official implementation of Transformer Neural Processes☆71Updated 2 years ago
- ☆47Updated 2 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆42Updated 6 months ago
- MoMo: Momentum Models for Adaptive Learning Rates☆17Updated 7 months ago
- Experiment code for "Continuous-Time Model-Based Reinforcement Learning"☆47Updated last year
- Distrax, but in equinox. Lightweight JAX library of probability distributions and bijectors.☆24Updated last month