Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"
☆25May 5, 2024Updated 2 years ago
Alternatives and similar repositories for s2pg
Users that are interested in s2pg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jul 16, 2024Updated last year
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆36Jul 8, 2025Updated 11 months ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆35May 18, 2026Updated 3 weeks ago
- Deep memory and sequence models in JAX☆28Jun 1, 2026Updated last week
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆29Jan 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for my master thesis on hierarchical probabilistic forecasting of smart meter time series using weather input.☆11Aug 21, 2022Updated 3 years ago
- Partially Observable Multi-Agent RL with Transformers☆17Apr 27, 2026Updated last month
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆32Oct 12, 2023Updated 2 years ago
- ☆15Aug 8, 2022Updated 3 years ago
- ☆10Jun 27, 2024Updated last year
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 7 years ago
- A digital data-generation pipeline that synthesizes humanoid loco-manipulation data from 3D assets and video priors.☆249Jun 4, 2026Updated last week
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Feb 23, 2024Updated 2 years ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆29Aug 19, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- Reinforcement Learning inside a 3D soccer simulation☆38Sep 15, 2024Updated last year
- [ICML 2025 GenBio Workshop] Official Implementation for "Electrostatics from Laplacian Eigenbasis for Neural Network Interatomic Potentia…☆18Jun 12, 2025Updated 11 months ago
- Pointax: PointMaze Environment for JAX☆28Oct 22, 2025Updated 7 months ago
- Code accompanying the latent-action-priors paper.☆12Mar 5, 2025Updated last year
- ☆24May 20, 2025Updated last year
- Code for "Baba Is AI: Break the Rules to Beat the Benchmark"☆47Sep 3, 2025Updated 9 months ago
- [ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆30Mar 1, 2024Updated 2 years ago
- ☆12Apr 1, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆78Dec 26, 2024Updated last year
- ☆19Apr 22, 2024Updated 2 years ago
- ☆46Jul 12, 2024Updated last year
- Official codebase for "Sampling For Learnability", published at NeurIPS 2024☆22Oct 21, 2025Updated 7 months ago
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆35Sep 18, 2024Updated last year
- Code for the ICML 2020 publication "Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continu…☆14Jul 3, 2020Updated 5 years ago
- ☆19Jul 4, 2025Updated 11 months ago
- ☆17Aug 20, 2025Updated 9 months ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆52Dec 8, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆96Jun 4, 2024Updated 2 years ago
- Train, visualize, and evaluate RL policies for the Terra environment.☆20May 22, 2026Updated 2 weeks ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- Official Implementation of SFM and the baselines in Jax.☆21May 31, 2025Updated last year
- ☆21May 5, 2026Updated last month
- Official implementation of HEAD CoRL 2025☆26Aug 22, 2025Updated 9 months ago
- High-performance JAX-powered simulator for robotic navigation in 2D mazes, optimized for Quality-Diversity algorithm research and benchma…☆20Jun 19, 2025Updated 11 months ago