yinizhilian / ICML2025-Papers-with-CodeLinks
历年ICML论文和开源项目合集,包含ICML2021、ICML2022、ICML2023、ICML2024、ICML2025.
☆26Updated 6 months ago
Alternatives and similar repositories for ICML2025-Papers-with-Code
Users that are interested in ICML2025-Papers-with-Code are comparing it to the libraries listed below
Sorting:
- A curated list of visual reinforcement learning resources☆373Updated 2 months ago
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆161Updated 2 months ago
- RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforce…☆324Updated this week
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆106Updated 6 months ago
- 历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.☆417Updated 5 months ago
- NeurIPS 2024 DACER☆138Updated 3 weeks ago
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆79Updated 8 months ago
- [AAAI-2025] The offical code for SiTo (Similarity-based Token Pruning for Stable Diffusion Models)☆36Updated 3 months ago
- CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making☆635Updated 4 months ago
- The official implementation of Natural Language Fine-Tuning☆53Updated 8 months ago
- ⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆197Updated last month
- 📚 Collection of token-level model compression resources.☆155Updated last week
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆132Updated 8 months ago
- 🎉🎨 Papers, CODE, Datasets for Meta-Learning and Meta-Reinforcement-Learning☆43Updated last year
- Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.☆510Updated 9 months ago
- [EMNLP 2025 main] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆72Updated last week
- compare the theory attention gradient with PyTorch attention gradient☆15Updated last year
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆138Updated 4 months ago
- Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory☆384Updated 2 months ago
- A collection of paper/projects that trains flow matching model/policies via RL.☆231Updated last week
- [ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".☆151Updated 3 months ago
- ☆103Updated 11 months ago
- A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and A…☆387Updated this week
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆110Updated 2 months ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆35Updated last month
- [arXiv 2025] Efficient Reasoning Models: A Survey☆259Updated this week
- ☆22Updated 10 months ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆479Updated 11 months ago
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆180Updated last month
- An Introduction to Embodied Intelligence (A Quick Guide of Embodied-AI) (Updating)☆134Updated 4 months ago