Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
☆19Mar 18, 2024Updated 2 years ago
Alternatives and similar repositories for Muesli-lunarlander
Users that are interested in Muesli-lunarlander are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Nov 4, 2021Updated 4 years ago
- ☆12Sep 7, 2024Updated last year
- Multi-agent Reinforcement Learning, 14th in 701 teams - NeurIPS 2024 Competition☆14Mar 13, 2025Updated last year
- ☆28Dec 15, 2025Updated 4 months ago
- A scaffold to speed up launching a flask project.☆15Apr 8, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024☆26Apr 7, 2024Updated 2 years ago
- Image-based gridworld experiment for learning Markov state abstractions☆21Sep 16, 2024Updated last year
- Implementation of Vision Transformers in Flax☆18Oct 12, 2020Updated 5 years ago
- 3rd Place Solution in Lux AI Season 3 (NeurIPS 2024) Competition☆15Mar 16, 2025Updated last year
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- ☆30Jan 22, 2026Updated 2 months ago
- This extension will check web content and convert to Unicode encoded text if they are Zawgyi.☆22Oct 5, 2020Updated 5 years ago
- ☆22Mar 17, 2025Updated last year
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆14Mar 5, 2024Updated 2 years ago
- Self Supervised Learning for Time Series Using Similarity Distillation☆12Jun 29, 2022Updated 3 years ago
- A feature-enhanced TypeScript drop-in replacement for a very popular and simple-to-use debug module.☆34Jul 7, 2025Updated 9 months ago
- Automatic Differentiation for Gradient Boosted Decision Trees.☆13May 17, 2022Updated 3 years ago
- ComfyUI custom nodes for AudioX — generate sound effects and background music from video, powered by HKUSTAudio/AudioX.☆36Mar 12, 2026Updated last month
- Official code implementation for the ACL 2024 Student Research Workshop paper "In-Context Symbolic Regression: Leveraging Large Language …☆18Sep 26, 2024Updated last year
- Structured Denoising Diffusion Models in Discrete State-Spaces☆15Dec 10, 2022Updated 3 years ago
- AGC 5차 대회 소스 코드 저장소입니다.☆10Jun 6, 2023Updated 2 years ago
- [CVPR'26 Findings] Source code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglom…☆40Mar 7, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆35Jun 28, 2024Updated last year
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 6 months ago
- Learning High-Quality and General-Purpose Phrase Representations. Findings of EACL 2024☆16Feb 29, 2024Updated 2 years ago
- ☆34Mar 3, 2025Updated last year
- ☆10Oct 12, 2021Updated 4 years ago
- ☆19Apr 16, 2025Updated last year
- GPGPU array on Vulkan☆17Jun 3, 2023Updated 2 years ago
- [ICLR26 Oral] RealPDEBench: A Benchmark for Complex Physical Systems with Paired Real-World and Simulated Data☆73Mar 8, 2026Updated last month
- Code for the paper An analysis of spectral similarity measures☆11Dec 19, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Application and blog explaining my interpretations of In-run Data Shapley☆31Jan 30, 2025Updated last year
- Novelty Detection with Reconstruction along Projection Pathway☆10May 10, 2021Updated 4 years ago
- Implementation of the "Sim-to-Real Transfer of Robotic Control with Dynamics Randomization" paper☆13Sep 8, 2021Updated 4 years ago
- ☆18Aug 3, 2022Updated 3 years ago
- Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs☆27Dec 17, 2024Updated last year
- SLM-SQL: An Exploration of Small Language Models for Text-to-SQL☆32Aug 12, 2025Updated 8 months ago
- Representation Learning in RL☆13Jun 1, 2022Updated 3 years ago