caiyuchen-ustc / Alpha-RLLinks
On Predictability of Reinforcement Learning Dynamics for Large Language Models
☆21Updated last month
Alternatives and similar repositories for Alpha-RL
Users that are interested in Alpha-RL are comparing it to the libraries listed below
Sorting:
- Enhanced Benchmark Creation Tool: Automates dataset profiling, model benchmarking, and performance visualization for streamlined evaluati…☆110Updated 5 months ago
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆310Updated 9 months ago
- AIGC Creative Suite☆202Updated 5 months ago
- ☆422Updated 4 months ago
- A Trusted Human-Multi-Agent Reinforcement Learning Interaction Framework☆503Updated 2 weeks ago
- ☆315Updated 7 months ago
- Group Expectation Policy Optimization for Heterogeneous Reinforcement Learning☆158Updated 2 weeks ago
- ☆514Updated 8 months ago
- A L4 innovative AGI System Empowering miRNA Drug Discovery☆329Updated 4 months ago
- Firmware for a 100W DC Electronic Load based on STM32F405 and LVGL (Keil MDK Project).☆499Updated 4 months ago
- ☆482Updated last year
- ☆559Updated this week
- ☆213Updated 5 months ago
- Open-source models for financial risk detection and fraud analytics☆386Updated last week
- 日历软件重写☆452Updated 7 months ago
- NEW EDU☆201Updated 3 weeks ago
- PVPAI LLM 🔥The First Open-Source DeFAI Large Language Model Powered by DeepSeek.☆302Updated 9 months ago
- ☆414Updated 4 months ago
- docker-compose-starter☆110Updated 4 months ago
- OmniAgent Framework is an advanced, modular AI orchestration system that transforms Web3 development by seamlessly integrating artificial…☆319Updated 9 months ago
- 小而美的Vue3异步处理解决方案,让复杂的异步逻辑变得简单优雅,让重复的样板代码成为历史☆480Updated last month
- ☆535Updated last month
- Welcome to BlockSeek's official documentation. BlockSeek combines state-of-the-art AI with blockchain technology to revolutionize cryptoc…☆310Updated 8 months ago
- Vexa is a decentralized AI agent platform built on BNB Chain.☆349Updated 7 months ago
- ☆207Updated 7 months ago
- [ACL 2025 Oral] QAEncoder: Towards Aligned Representation Learning in Question Answering Systems☆176Updated 3 months ago
- AI Integrated Professional Document Reader☆648Updated this week
- The 1st dynamic phishing kit dataset☆201Updated 8 months ago
- ☆130Updated 4 months ago
- Joint Semantic Detection and Dissemination Control of Phishing Attacks on Social Media via LLama- Based Modeling☆474Updated 2 weeks ago