Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
☆20Mar 18, 2024Updated 2 years ago
Alternatives and similar repositories for Muesli-lunarlander
Users that are interested in Muesli-lunarlander are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Nov 4, 2021Updated 4 years ago
- A scaffold to speed up launching a flask project.☆15Jun 3, 2026Updated 3 weeks ago
- Thinker project☆16Sep 4, 2024Updated last year
- ☆10Dec 3, 2022Updated 3 years ago
- Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024☆27Apr 26, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Image-based gridworld experiment for learning Markov state abstractions☆20Sep 16, 2024Updated last year
- Implementation of Vision Transformers in Flax☆18Oct 12, 2020Updated 5 years ago
- 3rd Place Solution in Lux AI Season 3 (NeurIPS 2024) Competition☆14Mar 16, 2025Updated last year
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆84Nov 19, 2022Updated 3 years ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆19Jun 3, 2025Updated last year
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆133May 9, 2026Updated last month
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- ☆30Apr 29, 2026Updated 2 months ago
- This extension will check web content and convert to Unicode encoded text if they are Zawgyi.☆22Oct 5, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆16May 19, 2023Updated 3 years ago
- [CVPR 2026] 3D-Fixer: Coarse-to-Fine In-place Completion for 3D Scenes from a Single Image☆82Jun 22, 2026Updated last week
- ☆25Mar 17, 2025Updated last year
- Automatic Differentiation for Gradient Boosted Decision Trees.☆13May 17, 2022Updated 4 years ago
- Official code implementation for the ACL 2024 Student Research Workshop paper "In-Context Symbolic Regression: Leveraging Large Language …☆18Sep 26, 2024Updated last year
- ComfyUI custom nodes for AudioX — generate sound effects and background music from video, powered by HKUSTAudio/AudioX.☆38Mar 12, 2026Updated 3 months ago
- The node has been created with an objective of identity consistency for FLUX.2 klein 9b models in ComfyUI.☆60May 13, 2026Updated last month
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆51Mar 8, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AGC 5차 대회 소스 코드 저장소입니다.☆10Jun 6, 2023Updated 3 years ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 8 months ago
- ☆34Mar 3, 2025Updated last year
- ☆16Nov 20, 2023Updated 2 years ago
- ☆10Oct 12, 2021Updated 4 years ago
- Princeton University - COS/ECE 473 : Elements of Decentralized Finance☆11Apr 12, 2023Updated 3 years ago
- Code for the paper An analysis of spectral similarity measures☆11Dec 19, 2022Updated 3 years ago
- Application and blog explaining my interpretations of In-run Data Shapley☆31Jan 30, 2025Updated last year
- Implementation of the "Sim-to-Real Transfer of Robotic Control with Dynamics Randomization" paper☆13Sep 8, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Novelty Detection with Reconstruction along Projection Pathway☆10May 10, 2021Updated 5 years ago
- MegaStyle, 面向一致性与多样性的可扩展风格数据生成框架☆129Apr 23, 2026Updated 2 months ago
- Whitening for Self-Supervised Representation Learning | Official repository☆137Feb 5, 2023Updated 3 years ago
- SWI Prolog library to interface to the GPT API☆20Mar 6, 2024Updated 2 years ago
- ☆18Aug 3, 2022Updated 3 years ago
- Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive Windowing☆10Dec 14, 2018Updated 7 years ago
- Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs☆29Dec 17, 2024Updated last year