Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
☆19Mar 18, 2024Updated last year
Alternatives and similar repositories for Muesli-lunarlander
Users that are interested in Muesli-lunarlander are comparing it to the libraries listed below
Sorting:
- ☆18Nov 4, 2021Updated 4 years ago
- Multi-agent Reinforcement Learning, 14th in 701 teams - NeurIPS 2024 Competition☆14Mar 13, 2025Updated 11 months ago
- ☆14Mar 5, 2024Updated 2 years ago
- [CVPR'26 Findings] Source code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglom…☆33Feb 24, 2026Updated last week
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆122Feb 25, 2026Updated last week
- Code for the paper An analysis of spectral similarity measures☆11Dec 19, 2022Updated 3 years ago
- Novelty Detection with Reconstruction along Projection Pathway☆10May 10, 2021Updated 4 years ago
- SLM-SQL: An Exploration of Small Language Models for Text-to-SQL☆30Aug 12, 2025Updated 6 months ago
- Application and blog explaining my interpretations of In-run Data Shapley☆24Jan 30, 2025Updated last year
- A scaffold to speed up launching a flask project.☆15Jan 19, 2026Updated last month
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Mar 8, 2024Updated 2 years ago
- 3rd Place Solution in Lux AI Season 3 (NeurIPS 2024) Competition☆15Mar 16, 2025Updated 11 months ago
- [ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference☆64Updated this week
- [SIGMOD 2022] Python code for "Dimension-wise Class Activation Map for Multivariate Time Series Classification"☆19Oct 8, 2025Updated 5 months ago
- A hydra integrated template for LG-Dacon Competetion☆13Jul 20, 2021Updated 4 years ago
- ☆30Jan 22, 2026Updated last month
- LucidFlux: Caption-Free Universal Image Restoration with a Large-Scale Diffusion Transformer,you can use it in ComfyUI☆57Jan 12, 2026Updated last month
- Automatic Differentiation for Gradient Boosted Decision Trees.☆13May 17, 2022Updated 3 years ago
- Learning High-Quality and General-Purpose Phrase Representations. Findings of EACL 2024☆16Feb 29, 2024Updated 2 years ago
- Code accompanying "Inverse-Dirichlet Weighting Enables Reliable Training of Physics Informed Neural Networks", Maddu et al., 2021☆14Nov 3, 2021Updated 4 years ago
- [ICLR26 Oral] RealPDEBench: A Benchmark for Complex Physical Systems with Paired Real-World and Simulated Data☆43Feb 23, 2026Updated last week
- ☆16Aug 6, 2024Updated last year
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- Implementation of the "Sim-to-Real Transfer of Robotic Control with Dynamics Randomization" paper☆13Sep 8, 2021Updated 4 years ago
- Structured Denoising Diffusion Models in Discrete State-Spaces☆15Dec 10, 2022Updated 3 years ago
- LocalStack website☆12Nov 21, 2023Updated 2 years ago
- This repository introduces Partial Differential Equation Solver using neural network that can learn resolution-invariant solution operato…☆17Nov 23, 2021Updated 4 years ago
- [NeurIPS 2023] Official implementation of "Learning from Visual Observation via Offline Pretrained State-to-Go Transformer"☆18Oct 1, 2023Updated 2 years ago
- Single-file pytorch implementation of hybrid-SAC☆65Jun 25, 2021Updated 4 years ago
- This is the supporting website for the paper "Window Size Selection In Unsupervised Time Series Analytics: A Review and Benchmark".☆16Feb 27, 2026Updated last week
- Representation Learning in RL☆13Jun 1, 2022Updated 3 years ago
- Pytorch implementation of Planar Flow☆17Dec 2, 2019Updated 6 years ago
- ☆21Feb 22, 2025Updated last year
- [ICCVW 2023] TIFace: Improving Facial Reconstruction through Tensorial Radiance Fields and Implicit Surfaces. 1st place at VSCHH @ ICCV 2…☆18Dec 20, 2023Updated 2 years ago
- ☆21Apr 3, 2024Updated last year
- ☆17Mar 19, 2022Updated 3 years ago
- Contextualized per-token embeddings☆34May 11, 2025Updated 9 months ago
- [ICLR2026] The first W4A4KV4 quantized + 50% sparse LLMs!☆24Jan 26, 2026Updated last month
- Code for the paper: "Supervised contrastive learning over prototype-label embeddings for network intrusion detection"☆15Jun 7, 2021Updated 4 years ago