ZJU-REAL/TimeHC-RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZJU-REAL/TimeHC-RL)

ZJU-REAL / TimeHC-RL

This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).

☆48

Alternatives and similar repositories for TimeHC-RL

Users that are interested in TimeHC-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZJU-REAL / VerifyBench
View on GitHub
[ICLR 2026] VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models
☆21Feb 18, 2026Updated 5 months ago
ZJU-REAL / Self-Braking-Tuning
View on GitHub
[NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604
☆54Nov 4, 2025Updated 8 months ago
ZJU-REAL / Mind-the-Gap
View on GitHub
[NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684
☆47Oct 20, 2025Updated 9 months ago
ZJU-REAL / LAPO
View on GitHub
☆37Oct 9, 2025Updated 9 months ago
ZJU-REAL / SVGenius
View on GitHub
[ACM MM 2025] SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139
☆78Nov 10, 2025Updated 8 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ZJU-REAL / ViewSpatial-Bench
View on GitHub
[ECCV 2026] ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models
☆82Mar 9, 2026Updated 4 months ago
ZJU-REAL / GSM8K-V
View on GitHub
GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts
☆40Sep 30, 2025Updated 9 months ago
ZJU-REAL / HBPO
View on GitHub
☆34Aug 11, 2025Updated 11 months ago
ZJU-REAL / CoVerRL
View on GitHub
[ACL 2026 main] CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution
☆27Apr 18, 2026Updated 3 months ago
ZJU-REAL / cooper
View on GitHub
☆29Aug 19, 2025Updated 11 months ago
ZJU-REAL / BEACON
View on GitHub
[ICML 2026] Milestone-Guided Policy Learning for Long-Horizon Language Agents
☆37May 29, 2026Updated last month
ZJU-REAL / GRIL
View on GitHub
[ACL 2026 findings] Pause or Fabricate? Training Language Models for Grounded Reasoning
☆25Apr 24, 2026Updated 3 months ago
ZJU-REAL / InftyThink-Plus
View on GitHub
[ICML 2026] InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning
☆34May 25, 2026Updated last month
ZJU-REAL / GUI-G2
View on GitHub
[AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding
☆310Apr 15, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ZJU-REAL / HugAgentOS
View on GitHub
HugAgentOS: The Enterprise AgentOS for Ontology-Grounded Trustworthy Reasoning
☆62Updated this week
ZJU-REAL / UI-Zoomer
View on GitHub
☆36Apr 16, 2026Updated 3 months ago
ZJU-REAL / Perceive-to-Reason
View on GitHub
Perceive-to-Reason: Decoupling Perception and Reasoning for Fine-Grained Visual Reasoning
☆31Jul 8, 2026Updated 2 weeks ago
ZJU-REAL / KnowU-Bench
View on GitHub
Official code for "KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation"
☆76Jun 13, 2026Updated last month
ZJU-REAL / EasySteer
View on GitHub
A Unified Framework for High-Performance and Extensible LLM Steering
☆288Apr 30, 2026Updated 2 months ago
ZJU-OmniAI / GFT
View on GitHub
GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification
☆37Jun 10, 2026Updated last month
ZJU-REAL / Awesome-GUI-Agents
View on GitHub
A curated collection of resources, tools, and frameworks for developing GUI Agents.
☆446Jul 9, 2026Updated 2 weeks ago
ZJU-REAL / SkillZero
View on GitHub
Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"
☆355Updated this week
ZhouTimeMachine / note
View on GitHub
Jianjun Zhou's Notebook
☆26Nov 20, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zqtan1024 / sequence-to-set
View on GitHub
☆51Jul 22, 2021Updated 5 years ago
Egbert-Lannister / Robo-Imagine
View on GitHub
Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …
☆31Jul 13, 2025Updated last year
XiPotatonium / pnr
View on GitHub
Accepted at IJCAI-2022
☆11Sep 3, 2022Updated 3 years ago
DAMO-NLP-SG / multimodal_textbook
View on GitHub
[ICCV 2025 Highlight] The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"
☆196Mar 17, 2025Updated last year
ZJU-OmniAI / vla-corrector
View on GitHub
a lightweight detect-and-correction inference for vla
☆61Jul 6, 2026Updated 2 weeks ago
GAIR-NLP / InnovatorBench
View on GitHub
[ICLR 2026]InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research
☆16Feb 3, 2026Updated 5 months ago
Tongyi-ConvAI / Qwen-Character
View on GitHub
☆50Jun 16, 2026Updated last month
ASTRAL-Group / AlphaOne
View on GitHub
[EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
☆89Jun 10, 2025Updated last year
Qznan / SpanKL
View on GitHub
Code for paper: A Neural Span-Based Continual Named Entity Recognition Model
☆18Dec 11, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PRIS-CV / FairHuman
View on GitHub
☆73Jul 10, 2025Updated last year
nex-agi / weaver
View on GitHub
Python SDK for Weaver.
☆17Updated this week
ZJU-REAL / ClawGUI
View on GitHub
Build, Evaluate, and Deploy GUI Agents — online RL training, standardized benchmarks, and real-device deployment in one framework.
☆1,319Jun 3, 2026Updated last month
wenquanlu / huginn-latent-cot
View on GitHub
[COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…
☆20Oct 4, 2025Updated 9 months ago
sugarandgugu / GaVaMoE
View on GitHub
code for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation
☆18Dec 7, 2024Updated last year
assafbk / mocha_code
View on GitHub
Mitigating Open-Vocabulary Caption Hallucinations (EMNLP 2024)
☆19Oct 18, 2024Updated last year
zwq2018 / Auto_star
View on GitHub
auto star for repo lists
☆10Aug 26, 2023Updated 2 years ago