iiisthu / ailabLinks
☆36Updated 7 months ago
Alternatives and similar repositories for ailab
Users that are interested in ailab are comparing it to the libraries listed below
Sorting:
- ☆24Updated 2 months ago
- Run TRex with PPO☆39Updated 7 months ago
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆21Updated last year
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆186Updated 2 months ago
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆163Updated last year
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆63Updated last year
- Course Archive for AI Major in SJTU☆87Updated last year
- Paper list for Efficient Reasoning.☆768Updated last week
- ☆202Updated 4 months ago
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆926Updated 2 months ago
- ☆240Updated last year
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆530Updated last month
- llm & rl☆261Updated last month
- VLA-Arena is an open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models.☆72Updated last week
- Cool Papers - Immersive Paper Discovery☆672Updated 3 months ago
- A Survey of Reinforcement Learning for Large Reasoning Models☆2,178Updated last month
- Training VLM agents with multi-turn reinforcement learning☆347Updated 2 weeks ago
- RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforce…☆1,722Updated this week
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆35Updated last month
- Awesome List for Agentic RL☆632Updated last week
- 历年ICML论文和开源项目合集,包含ICML2021、ICML2022、ICML2023、ICML2024、ICML2025.☆38Updated 9 months ago
- A curated list of awesome papers on Embodied AI and related research/industry-driven resources.☆490Updated 6 months ago
- SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning☆1,109Updated 2 months ago
- ☆82Updated last year
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆398Updated 5 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆319Updated 2 months ago
- 🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…☆64Updated 6 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆686Updated 11 months ago
- ☆448Updated 2 months ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆328Updated 7 months ago