caiyuchen-ustc / Alpha-RLLinks
On Predictability of Reinforcement Learning Dynamics for Large Language Models
☆15Updated last week
Alternatives and similar repositories for Alpha-RL
Users that are interested in Alpha-RL are comparing it to the libraries listed below
Sorting:
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆310Updated 8 months ago
- AIGC Creative Suite☆202Updated 4 months ago
- ☆514Updated 7 months ago
- Enhanced Benchmark Creation Tool: Automates dataset profiling, model benchmarking, and performance visualization for streamlined evaluati…☆110Updated 5 months ago
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆52Updated last year
- NEW EDU☆145Updated last week
- A Trusted Human-Multi-Agent Reinforcement Learning Interaction Framework☆503Updated 2 months ago
- A L4 innovative AGI System Empowering miRNA Drug Discovery☆329Updated 3 months ago
- ☆375Updated 11 months ago
- Group Expectation Policy Optimization for Heterogeneous Reinforcement Learning☆153Updated last week
- Vexa is a decentralized AI agent platform built on BNB Chain.☆349Updated 6 months ago
- 小而美的Vue3异步处理解决方案,让复杂的异步逻辑变得简单优雅,让重复的样板代码成为历史☆327Updated 2 weeks ago
- ☆213Updated 4 months ago
- ☆315Updated 6 months ago
- Open-source models for financial risk detection and fraud analytics☆333Updated 2 weeks ago
- docker-compose-starter☆110Updated 4 months ago
- A project aims to improve LLMs' pixel reasoning ability.☆81Updated last month
- ☆356Updated last month
- Firmware for a 100W DC Electronic Load based on STM32F405 and LVGL (Keil MDK Project).☆499Updated 3 months ago
- AI Integrated Professional Document Reader☆643Updated last week
- ☆160Updated 2 months ago
- This is the project for the paper of "Low-Light Video Enhancement via Spatial-Temporal Consistent Decomposition" in IJCAI2025☆82Updated 2 months ago
- 🔬 AI学术深度研究平台 | 大模型驱动的文献分析 | 一键生成可溯源研究报告 | 支持文献综述、靶点分析、竞争分析 | Word导出 | 中英双语 | suppr.wilddata.cn/deep-research☆188Updated 3 weeks ago
- OmniAgent Framework is an advanced, modular AI orchestration system that transforms Web3 development by seamlessly integrating artificial…☆319Updated 8 months ago
- Rust SDK and CLI for Swarm Framework with Multi-Agent Orchestration☆145Updated 8 months ago
- ☆201Updated 3 months ago
- Revolutionizing Cancer Treatment with AI & Robotics☆65Updated 6 months ago
- ☆160Updated 3 months ago
- ☆372Updated this week
- 这是一个数据分析项目 this is a data analysis project, thanks for watching☆81Updated last month