caiyuchen-ustc/Alpha-RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/caiyuchen-ustc/Alpha-RL)

caiyuchen-ustc / Alpha-RL

On Predictability of Reinforcement Learning Dynamics for Large Language Models (ICLR 2026)

☆160

Alternatives and similar repositories for Alpha-RL

Users that are interested in Alpha-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gulucaptain / DynamiCtrl
View on GitHub
[TMM'26] Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.
☆142May 23, 2025Updated last year
XYFrank103 / StegoShark
View on GitHub
StegoShark is a tool for Image & Audio Steganography and digital signatures. StegoShark allows you to securely hide and extract text or f…
☆81Aug 14, 2025Updated 10 months ago
zjukg / Enrich-on-Graph
View on GitHub
[Paper][EMNLP 2025] Enrich-on-Graph: Query-Graph Alignment for Complex Reasoning with LLM Enriching
☆35Feb 8, 2026Updated 5 months ago
oeasenet / goe
View on GitHub
Goe is a full-featured Golang application development framework inspired by many amazing frameworks. It provides a modular, interface-bas…
☆78Updated this week
Jinxhy / THEMIS
View on GitHub
[USENIX Security'25] THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models
☆108Aug 13, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
liufanfanlff / C3-Context-Cascade-Compression
View on GitHub
Official code implementation of Context Cascade Compression: Exploring the Upper Limits of Text Compression
☆312Jan 27, 2026Updated 5 months ago
DEFENSE-SEU / RobustFlow
View on GitHub
Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"
☆238Oct 19, 2025Updated 8 months ago
Tencent-Hunyuan / MixGRPO
View on GitHub
[ECCV 2026] MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE
☆1,147Jul 1, 2026Updated last week
zhangyulin-space / ChatFerry
View on GitHub
☆104Oct 8, 2025Updated 9 months ago
UCSB-AI / Mojito
View on GitHub
Official repo for the paper "Mojito: Motion Trajectory and Intensity Control for Video Generation""
☆62May 12, 2026Updated last month
FxPool / FXMinerProxy
View on GitHub
🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用
☆3,707May 22, 2026Updated last month
hyang-cyber / Dolus
View on GitHub
Building a 5GHz WiFi Spoofer with BW16 ( Realtek RTL8720dn)
☆130Apr 20, 2026Updated 2 months ago
ECNU-SII / Continual-NExT
View on GitHub
☆235Jun 27, 2026Updated last week
curryqka / AgentThink
View on GitHub
[EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!
☆147Sep 27, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
YUHAO-corn / manufacturing-agents
View on GitHub
Multi-agent LLM system for intelligent replenishment decisions in manufacturing supply chains
☆166Mar 13, 2026Updated 3 months ago
YOUNG-bit / OpenGS-Fusion
View on GitHub
[IROS2025] OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding
☆76Aug 2, 2025Updated 11 months ago
ServerlessOS / Fuyao
View on GitHub
Source code of Fuyao, built on Nightcore
☆17Mar 8, 2024Updated 2 years ago
gwh22 / UniVoice
View on GitHub
UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models
☆115Oct 30, 2025Updated 8 months ago
grenoble-zhang / Proteus-ID
View on GitHub
Proteus-ID: ID-Consistent and Motion-Coherent Video Customization [Siggraph Asia 2025]
☆70Jun 24, 2026Updated 2 weeks ago
MarkLee131 / PoC-Research-Papers
View on GitHub
Research papers on Proot-of-Concepts
☆114Feb 3, 2026Updated 5 months ago
mzmm403 / EasyCollectiveUI
View on GitHub
A comprehensive component library for vue
☆51Mar 18, 2025Updated last year
bcmi / Composite-Image-Evaluation
View on GitHub
☆24Feb 19, 2026Updated 4 months ago
EvilGenius-dot / RustMinerSystem
View on GitHub
💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水矿池代理矿池中转矿池抽…
☆3,873Jul 1, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
YOUNG-bit / open_semantic_slam
View on GitHub
ICRA2025: OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding
☆305Mar 27, 2025Updated last year
Uderwood-TZ / LSTM-PINN-and-PINN-for-population-forecasting
View on GitHub
LSTM-PINN and PINN for population forecasting
☆39May 9, 2025Updated last year
JingyuanXu / ucfaceconbainall
View on GitHub
Unified Semantic Curation Face (USCFace): An RDF Curation & Visualization System
☆38Jul 18, 2025Updated 11 months ago
Docta-ai / docta
View on GitHub
A Doctor for your data
☆3,481Jun 16, 2026Updated 3 weeks ago
Klavis-AI / klavis
View on GitHub
Klavis AI: MCP integration platforms that let AI agents use tools reliably at any scale
☆5,766Jun 1, 2026Updated last month
DeepWism / DeepWism-miRNA
View on GitHub
A L4 innovative AGI System Empowering miRNA Drug Discovery
☆329Jul 1, 2025Updated last year
Everlyn-Labs / Everlyn-1
View on GitHub
The first open autoregressive foundational video AI model.
☆2,892Oct 14, 2024Updated last year
TJU-DRL-LAB / AI-Optimizer
View on GitHub
The next generation deep reinforcement learning tookit
☆3,464Jun 16, 2023Updated 3 years ago
IAAR-Shanghai / QAEncoder
View on GitHub
[ACL 2025 Oral] QAEncoder: Towards Aligned Representation Learning in Question Answering Systems
☆176Jul 12, 2025Updated 11 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
sinberCS / switch2ai
View on GitHub
switch2ai - A JetBrains IDE plugin enabling seamless collaboration between JetBrains IDEs and various AI agents (Cursor, Qoder, Claude co…
☆173Nov 11, 2025Updated 7 months ago
WuKongOpenSource / WukongCRM-11.0-JAVA
View on GitHub
悟空CRM-基于Spring Cloud Alibaba微服务架构 +vue ElementUI的前后端分离CRM系统
☆2,430Aug 27, 2021Updated 4 years ago
isoftstone-data-intelligence-ai / efflux-backend
View on GitHub
☆722Jun 19, 2025Updated last year
Tanglumy / Finance-Bro
View on GitHub
your finance bro Agent for trading and investing
☆111Nov 8, 2025Updated 8 months ago
JamesLLMs / LDM
View on GitHub
A Lightweight Learning Framework for Dexterous Manipulation
☆295Feb 26, 2026Updated 4 months ago
photon-hq / rapid-ai-dev
View on GitHub
Toolkit for rapid AI and agent prototyping
☆148Mar 6, 2026Updated 4 months ago
jiaweizzhao / InRank
View on GitHub
☆153Jan 2, 2024Updated 2 years ago