Itomigna2 / Muesli-lunarlanderView external linksLinks
Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
☆19Mar 18, 2024Updated last year
Alternatives and similar repositories for Muesli-lunarlander
Users that are interested in Muesli-lunarlander are comparing it to the libraries listed below
Sorting:
- AGC 5차 대회 소스 코드 저장소입니다.☆10Jun 6, 2023Updated 2 years ago
- ☆14Mar 5, 2024Updated last year
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆121Updated this week
- Novelty Detection with Reconstruction along Projection Pathway☆10May 10, 2021Updated 4 years ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 4 months ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago
- Official code implementation for the ACL 2024 Student Research Workshop paper "In-Context Symbolic Regression: Leveraging Large Language …☆17Sep 26, 2024Updated last year
- ☆15Nov 20, 2023Updated 2 years ago
- SLM-SQL: An Exploration of Small Language Models for Text-to-SQL☆29Aug 12, 2025Updated 6 months ago
- Self Supervised Learning for Time Series Using Similarity Distillation☆12Jun 29, 2022Updated 3 years ago
- Code for the paper An analysis of spectral similarity measures☆11Dec 19, 2022Updated 3 years ago
- ☆10Oct 12, 2021Updated 4 years ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆15Jun 3, 2025Updated 8 months ago
- RealPDEBench: A Benchmark for Complex Physical Systems with Paired Real-World and Simulated Data☆34Updated this week
- Princeton University - COS/ECE 473 : Elements of Decentralized Finance☆11Apr 12, 2023Updated 2 years ago
- [SIGMOD 2022] Python code for "Dimension-wise Class Activation Map for Multivariate Time Series Classification"☆19Oct 8, 2025Updated 4 months ago
- 3rd Place Solution in Lux AI Season 3 (NeurIPS 2024) Competition☆15Mar 16, 2025Updated 11 months ago
- Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive Windowing☆10Dec 14, 2018Updated 7 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Mar 8, 2024Updated last year
- GPGPU array on Vulkan☆17Jun 3, 2023Updated 2 years ago
- Time Series Change Point Detection based on Contrastive Predictive Coding pytorch implementation☆12Oct 20, 2022Updated 3 years ago
- Automatic Differentiation for Gradient Boosted Decision Trees.☆13May 17, 2022Updated 3 years ago
- ☆30Jan 22, 2026Updated 3 weeks ago
- Structured Denoising Diffusion Models in Discrete State-Spaces☆15Dec 10, 2022Updated 3 years ago
- Spectral methods in matlab☆12Mar 27, 2025Updated 10 months ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- ☆16Aug 6, 2024Updated last year
- ☆14Jul 14, 2021Updated 4 years ago
- Code accompanying "Inverse-Dirichlet Weighting Enables Reliable Training of Physics Informed Neural Networks", Maddu et al., 2021☆14Nov 3, 2021Updated 4 years ago
- [ICLR2026] The first W4A4KV4 quantized + 50% sparse LLMs!☆22Jan 26, 2026Updated 2 weeks ago
- LocalStack website☆12Nov 21, 2023Updated 2 years ago
- This is the supporting website for the paper "Window Size Selection In Unsupervised Time Series Analytics: A Review and Benchmark".☆16Nov 3, 2023Updated 2 years ago
- Representation Learning in RL☆13Jun 1, 2022Updated 3 years ago
- 패스트캠퍼스 김현정 강사님의 자료구조&알고리즘 Part 1 입니다.☆15Jul 21, 2023Updated 2 years ago
- Unofficial minimal implementation of consistency models (CM) proposed by Song et al. 2023 on a 1D toy task in pytorch☆21May 2, 2023Updated 2 years ago
- ☆19Apr 16, 2025Updated 9 months ago
- ☆21Feb 22, 2025Updated 11 months ago
- Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs☆27Dec 17, 2024Updated last year
- ☆21Apr 3, 2024Updated last year