Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
☆19Mar 18, 2024Updated 2 years ago
Alternatives and similar repositories for Muesli-lunarlander
Users that are interested in Muesli-lunarlander are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Nov 4, 2021Updated 4 years ago
- ☆12Sep 7, 2024Updated last year
- Multi-agent Reinforcement Learning, 14th in 701 teams - NeurIPS 2024 Competition☆14Mar 13, 2025Updated last year
- ☆28Dec 15, 2025Updated 5 months ago
- A scaffold to speed up launching a flask project.☆15Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Thinker project☆16Sep 4, 2024Updated last year
- ☆10Dec 3, 2022Updated 3 years ago
- The official Python library for Formulaic☆18Apr 25, 2024Updated 2 years ago
- Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024☆27Apr 26, 2026Updated last month
- Image-based gridworld experiment for learning Markov state abstractions☆20Sep 16, 2024Updated last year
- 3rd Place Solution in Lux AI Season 3 (NeurIPS 2024) Competition☆14Mar 16, 2025Updated last year
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆84Nov 19, 2022Updated 3 years ago
- ☆21Feb 22, 2025Updated last year
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆129May 9, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- ☆30Apr 29, 2026Updated last month
- ☆10Dec 10, 2024Updated last year
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆16May 19, 2023Updated 3 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- ☆24Mar 17, 2025Updated last year
- Implementation of UltraMem, improved Product Key Memory design, from Bytedance AI labs☆28Nov 4, 2025Updated 7 months ago
- A feature-enhanced TypeScript drop-in replacement for a very popular and simple-to-use debug module.☆34May 29, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Automatic Differentiation for Gradient Boosted Decision Trees.☆13May 17, 2022Updated 4 years ago
- The node has been created with an objective of identity consistency for FLUX.2 klein 9b models in ComfyUI.☆59May 13, 2026Updated 3 weeks ago
- Structured Denoising Diffusion Models in Discrete State-Spaces☆15Dec 10, 2022Updated 3 years ago
- A PyTorch implementation of DeepMind's MuZero agent☆37Dec 1, 2023Updated 2 years ago
- AGC 5차 대회 소스 코드 저장소입니다.☆10Jun 6, 2023Updated 3 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆36Jun 28, 2024Updated last year
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 8 months ago
- ☆34Mar 3, 2025Updated last year
- Learning High-Quality and General-Purpose Phrase Representations. Findings of EACL 2024☆16Feb 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR'26 Findings] Source code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglom…☆56May 31, 2026Updated last week
- ☆19Apr 16, 2025Updated last year
- GPGPU array on Vulkan☆17Jun 3, 2023Updated 3 years ago
- Princeton University - COS/ECE 473 : Elements of Decentralized Finance☆11Apr 12, 2023Updated 3 years ago
- Code for the paper An analysis of spectral similarity measures☆11Dec 19, 2022Updated 3 years ago
- Application and blog explaining my interpretations of In-run Data Shapley☆31Jan 30, 2025Updated last year
- Implementation of the "Sim-to-Real Transfer of Robotic Control with Dynamics Randomization" paper☆13Sep 8, 2021Updated 4 years ago