enajx / NDP
☆48Updated 3 months ago
Related projects: ⓘ
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆73Updated 2 months ago
- ☆43Updated 2 months ago
- ☆34Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆82Updated 9 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆117Updated 10 months ago
- Efficient baselines for autocurricula in JAX.☆165Updated 3 weeks ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆87Updated last month
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆35Updated 2 months ago
- ☆59Updated last month
- Contains JAX implementation of algorithms for inverse reinforcement learning☆59Updated last month
- Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation☆117Updated last month
- ☆65Updated 2 months ago
- DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.☆206Updated last week
- ☆56Updated last month
- ☆141Updated 2 weeks ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆30Updated this week
- Repository for "Toward Artificial Open-Ended Evolution within Lenia using Quality-Diversity" (ALIFE 2024).☆13Updated 2 months ago
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆159Updated last year
- ☆17Updated 3 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code☆20Updated 3 weeks ago
- ☆39Updated 3 months ago
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆78Updated 6 months ago
- Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.☆25Updated 7 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- ☆28Updated last week
- In Progress Implementation of GATO style Generalist Multimodal model capable of image, text, RL and Robotics tasks☆43Updated 3 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆74Updated 7 months ago
- Visualizations of the theory behind diffusion models.☆63Updated 5 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆46Updated 3 months ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆55Updated last year