iiisthu / ailabLinks
☆36Updated 6 months ago
Alternatives and similar repositories for ailab
Users that are interested in ailab are comparing it to the libraries listed below
Sorting:
- ☆24Updated last month
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆183Updated last month
- VLA-Arena is an open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models.☆58Updated last week
- Run TRex with PPO☆39Updated 6 months ago
- 在没有sudo权限的情况下,在linux上使用clash☆163Updated last year
- ☆197Updated 4 months ago
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆21Updated last year
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆61Updated last year
- 历年ICML论文和开源项目合集,包含ICML2021、ICML2022、ICML2023、ICML2024、ICML2025.☆35Updated 8 months ago
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆33Updated 2 weeks ago
- Training VLM agents with multi-turn reinforcement learning☆324Updated 3 weeks ago
- Course Archive for AI Major in SJTU☆86Updated last year
- llm & rl☆254Updated last month
- Awesome List for Agentic RL☆553Updated this week
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆895Updated 2 months ago
- An reconstruction of RL Introduction and its course materials for a more efficient entry☆16Updated 6 months ago
- ☆230Updated last year
- ☆311Updated 6 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆400Updated 11 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆373Updated last month
- ICLR 2025 Agent-Related Papers☆72Updated last year
- A Telegram bot to recommend arXiv papers☆289Updated 2 weeks ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆387Updated 4 months ago
- 清华大学云盘 (Tsinghua Cloud) 批量下载助手,适用于分享的文件 size 过大导致无法直接下载的情况,本脚本添加了更多实用的小功能☆225Updated last year
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆209Updated 7 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆290Updated last year
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆317Updated last month
- ☆21Updated 4 months ago
- ☆440Updated 9 months ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆382Updated last year