Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
☆19Mar 18, 2024Updated last year
Alternatives and similar repositories for Muesli-lunarlander
Users that are interested in Muesli-lunarlander are comparing it to the libraries listed below
Sorting:
- Multi-agent Reinforcement Learning, 14th in 701 teams - NeurIPS 2024 Competition☆14Mar 13, 2025Updated 11 months ago
- AGC 5차 대회 소스 코드 저장소입니다.☆10Jun 6, 2023Updated 2 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆84Nov 19, 2022Updated 3 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- [CVPR'26 Findings] Source code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglom…☆33Feb 24, 2026Updated 2 weeks ago
- The official Python library for Formulaic☆18Apr 25, 2024Updated last year
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆124Feb 25, 2026Updated last week
- SLM-SQL: An Exploration of Small Language Models for Text-to-SQL☆30Aug 12, 2025Updated 6 months ago
- ☆10Oct 12, 2021Updated 4 years ago
- Self Supervised Learning for Time Series Using Similarity Distillation☆12Jun 29, 2022Updated 3 years ago
- Application and blog explaining my interpretations of In-run Data Shapley☆24Jan 30, 2025Updated last year
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆16Jun 3, 2025Updated 9 months ago
- Package for preprocessing Paderborn Bearing dataset☆12Jul 9, 2025Updated 8 months ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago
- Code for the paper An analysis of spectral similarity measures☆11Dec 19, 2022Updated 3 years ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 5 months ago
- Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive Windowing☆10Dec 14, 2018Updated 7 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Mar 8, 2024Updated 2 years ago
- [SIGMOD 2022] Python code for "Dimension-wise Class Activation Map for Multivariate Time Series Classification"☆19Oct 8, 2025Updated 5 months ago
- 3rd Place Solution in Lux AI Season 3 (NeurIPS 2024) Competition☆15Mar 16, 2025Updated 11 months ago
- Automatic Differentiation for Gradient Boosted Decision Trees.☆13May 17, 2022Updated 3 years ago
- ☆30Jan 22, 2026Updated last month
- ☆12Sep 7, 2024Updated last year
- Time Series Change Point Detection based on Contrastive Predictive Coding pytorch implementation☆12Oct 20, 2022Updated 3 years ago
- Repository containing article with examples of custom activation functions for Pytorch☆12Dec 9, 2019Updated 6 years ago
- LucidFlux: Caption-Free Universal Image Restoration with a Large-Scale Diffusion Transformer,you can use it in ComfyUI☆57Jan 12, 2026Updated last month
- Structured Denoising Diffusion Models in Discrete State-Spaces☆15Dec 10, 2022Updated 3 years ago
- ☆14Jul 14, 2021Updated 4 years ago
- Code accompanying "Inverse-Dirichlet Weighting Enables Reliable Training of Physics Informed Neural Networks", Maddu et al., 2021☆14Nov 3, 2021Updated 4 years ago
- [ICLR26 Oral] RealPDEBench: A Benchmark for Complex Physical Systems with Paired Real-World and Simulated Data☆55Updated this week
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- 머신러닝 프로젝트 포트폴리오 정리(Project portfolio for machine learning)☆19Jun 17, 2018Updated 7 years ago
- This repository introduces Partial Differential Equation Solver using neural network that can learn resolution-invariant solution operato…☆17Nov 23, 2021Updated 4 years ago
- Single-file pytorch implementation of hybrid-SAC☆65Jun 25, 2021Updated 4 years ago
- Unofficial minimal implementation of consistency models (CM) proposed by Song et al. 2023 on a 1D toy task in pytorch☆21May 2, 2023Updated 2 years ago
- 패스트캠퍼스 김현정 강사님의 자료구조&알고리즘 Part 1 입니다.☆15Jul 21, 2023Updated 2 years ago
- Pytorch implementation of Planar Flow☆17Dec 2, 2019Updated 6 years ago
- This is the supporting website for the paper "Window Size Selection In Unsupervised Time Series Analytics: A Review and Benchmark".☆16Feb 27, 2026Updated last week
- A python implementation of the concepts in the book "Reinforcement Learning: An Introduction" by R.S. Sutton and A. G. Barto.☆19Jul 13, 2020Updated 5 years ago