iiisthu / ailabLinks
☆36Updated 8 months ago
Alternatives and similar repositories for ailab
Users that are interested in ailab are comparing it to the libraries listed below
Sorting:
- ☆24Updated 3 months ago
- Run TRex with PPO☆39Updated 8 months ago
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆192Updated 3 months ago
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆21Updated 2 years ago
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆63Updated last year
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆406Updated 3 months ago
- Training VLM agents with multi-turn reinforcement learning☆381Updated this week
- ☆102Updated last week
- 在没有sudo权限的情况下,在linux上使用clash☆173Updated last year
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆35Updated 2 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆405Updated last year
- 历年ICML论文和开源项目合集,包含ICML2021、ICML2022、ICML2023、ICML2024、ICML2025.☆41Updated 10 months ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆539Updated 2 months ago
- Automated tool for running Python programs in a streamlined manner☆339Updated 2 weeks ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆414Updated 6 months ago
- ☆213Updated 6 months ago
- siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems☆330Updated this week
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆171Updated last year
- The homework of robos learning base.☆11Updated 2 years ago
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆62Updated last month
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆281Updated 11 months ago
- Awesome List for Agentic RL☆738Updated last month
- ☆412Updated 11 months ago
- A paper list of Awesome Latent Space.☆305Updated last week
- Paper list for Efficient Reasoning.☆806Updated last week
- 一个基于《AI 中的数学》教材内容的 AI 助教系统☆19Updated 8 months ago
- Open Platform for Embodied Agents☆339Updated last year
- ICLR 2025 Agent-Related Papers☆75Updated last year
- Course Archive for AI Major in SJTU☆88Updated last year
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆330Updated 9 months ago