iiisthu / ailabLinks
☆31Updated 4 months ago
Alternatives and similar repositories for ailab
Users that are interested in ailab are comparing it to the libraries listed below
Sorting:
- ☆22Updated 3 months ago
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆32Updated last month
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆61Updated last year
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆165Updated 3 months ago
- ☆21Updated 2 months ago
- ☆183Updated 2 months ago
- RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforce…☆428Updated this week
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆304Updated last week
- Run TRex with PPO☆39Updated 4 months ago
- ☆211Updated last year
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆134Updated last year
- 在没有sudo权限的情况下,在linux上使用clash☆145Updated 10 months ago
- ICLR 2025 Agent-Related Papers☆74Updated 10 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆205Updated 5 months ago
- An Awesome List of Agentic Model trained with Reinforcement Learning☆476Updated last week
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆815Updated last month
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆283Updated 10 months ago
- Open Platform for Embodied Agents☆329Updated 8 months ago
- Paper list for Efficient Reasoning.☆664Updated last week
- A Survey of Reinforcement Learning for Large Reasoning Models☆1,223Updated this week
- ☆218Updated last week
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆317Updated 5 months ago
- llm & rl☆218Updated last week
- ☆289Updated 4 months ago
- 清华大学云盘 (Tsinghua Cloud) 批量下载助手,适用于分享的文件 size 过大导致无法直接下载的情况,本脚本添加了更多实用的小功能☆216Updated 11 months ago
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆20Updated last year
- ☆400Updated 3 weeks ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆296Updated this week
- ☆420Updated 7 months ago
- AutoDL平台服务器适配梯子, 使用 Clash 作为代理工具☆456Updated 2 months ago