caiyuchen-ustc / Alpha-RLView external linksLinks
On Predictability of Reinforcement Learning Dynamics for Large Language Models (ICLR 2026)
☆73Jan 27, 2026Updated 2 weeks ago
Alternatives and similar repositories for Alpha-RL
Users that are interested in Alpha-RL are comparing it to the libraries listed below
Sorting:
- Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.☆140May 23, 2025Updated 8 months ago
- ☆223Nov 5, 2025Updated 3 months ago
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!☆138Sep 27, 2025Updated 4 months ago
- ☆156Jul 25, 2025Updated 6 months ago
- LSTM-PINN and PINN for population forecasting☆34May 9, 2025Updated 9 months ago
- [IROS2025] OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding☆75Aug 2, 2025Updated 6 months ago
- Unified Semantic Curation Face (USCFace): An RDF Curation & Visualization System☆38Jul 18, 2025Updated 6 months ago
- MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE☆1,101Feb 4, 2026Updated last week
- Official code implementation of Context Cascade Compression: Exploring the Upper Limits of Text Compression☆286Jan 27, 2026Updated 2 weeks ago
- your finance bro Agent for trading and investing☆108Nov 8, 2025Updated 3 months ago
- Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"☆232Oct 19, 2025Updated 3 months ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- ☆24Feb 14, 2024Updated 2 years ago
- Two languages, one purpose: turning words into geometry.☆160Dec 31, 2025Updated last month
- [USENIX Security'25] THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models☆108Aug 13, 2025Updated 6 months ago
- ☆117Aug 29, 2025Updated 5 months ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- 一款开源的静态代码扫描工具 - 服务端☆135Nov 6, 2025Updated 3 months ago
- ☆89Jan 28, 2026Updated 2 weeks ago
- ☆87Feb 3, 2026Updated last week
- ☆38Apr 21, 2025Updated 9 months ago
- Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training☆51Jul 28, 2025Updated 6 months ago
- 🐱 PawHaven — an open-source platform that helps volunteers, shelters, and adopters report, track, and share stray animal rescue cases (f…☆88Updated this week
- A Lightweight Learning Framework for Dexterous Manipulation☆207Feb 6, 2026Updated last week
- Research papers on Proot-of-Concepts☆76Feb 3, 2026Updated last week
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆24Updated this week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆22Nov 13, 2025Updated 3 months ago
- ☆154Jan 2, 2024Updated 2 years ago
- ICRA2025: OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding☆275Mar 27, 2025Updated 10 months ago
- "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"☆587Nov 1, 2025Updated 3 months ago
- INFTY Engine: An Optimization Toolkit to Support Continual AI☆567Sep 13, 2025Updated 5 months ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- MCP server for Grok AI API integration☆19Jun 2, 2025Updated 8 months ago
- ☆13Oct 21, 2024Updated last year
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated last month
- Auction Theory Toolbox – Computer Verified Auctions☆14Jul 12, 2016Updated 9 years ago
- ☆28Feb 3, 2026Updated last week
- Intelligent memory system for OpenWebUI with semantic retrieval, LLM consolidation, and adaptive context injection☆44Dec 2, 2025Updated 2 months ago
- [MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003☆10Oct 6, 2022Updated 3 years ago