AMAP-ML/Tree-GRPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AMAP-ML/Tree-GRPO)

AMAP-ML / Tree-GRPO

[ICLR 2026] Tree Search for LLM Agent Reinforcement Learning

☆386

Alternatives and similar repositories for Tree-GRPO

Users that are interested in Tree-GRPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AMAP-ML / ImagerySearch
View on GitHub
[AAAI2026] ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints
☆56Oct 23, 2025Updated 8 months ago
AMAP-ML / NarrLV
View on GitHub
[ICLR26] NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation Models
☆112Jul 28, 2025Updated 11 months ago
AMAP-ML / Taming-Hallucinations
View on GitHub
☆55Jun 3, 2026Updated last month
aba122 / Q-Hawkeye
View on GitHub
☆61Feb 9, 2026Updated 5 months ago
AMAP-ML / Eevee
View on GitHub
[CVPR 2026 Findings] Eevee: Towards Close-up High-resolution Video-based Virtual Try-on
☆74Feb 27, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
RainingNovember / LLaTiSA
View on GitHub
This is the official repository of "LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics".
☆78Apr 24, 2026Updated 2 months ago
AMAP-ML / Peak-End-Net
View on GitHub
[ACM MM 2026] Peak-End-Net: A Peak-End Rule Inspired Framework for Generalizable Video Aesthetic Assessment
☆26Updated this week
AMAP-ML / ADE-CoT
View on GitHub
[CVPR 26] From Scale to Speed: Adaptive Test-Time Scaling for Image Editing
☆47Jul 14, 2026Updated last week
AMAP-ML / BlockPilot
View on GitHub
☆62Jun 30, 2026Updated 3 weeks ago
AMAP-ML / MathForge
View on GitHub
[ICLR 2026] Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation
☆128May 17, 2026Updated 2 months ago
AMAP-ML / EMF
View on GitHub
[2026 CVPR]Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation
☆109Apr 15, 2026Updated 3 months ago
wangyifei0047 / Pos2Distill-
View on GitHub
[EMNLP25] Official code for "POSITION BIAS MITIGATES POSITION BIAS: Mitigate Position Bias Through Inter-Position Knowledge Distillation…
☆38Nov 11, 2025Updated 8 months ago
AMAP-ML / Omni-Effects
View on GitHub
[AAAI2026] Implementation Code for Omni-Effects
☆175Dec 9, 2025Updated 7 months ago
AMAP-ML / EPG
View on GitHub
[ICLR2026] There is No VAE: End-To-End Pixel-Space Generative Modeling Via Self-Supervised Pre-Training
☆152Mar 27, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AMAP-ML / APPO
View on GitHub
☆71Jun 11, 2026Updated last month
AMAP-ML / MACE-Dance
View on GitHub
[SIGGRAPH 2026] MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
☆105May 19, 2026Updated 2 months ago
Int-SR / DSFNet
View on GitHub
[www2025]DSFNet: Learning Disentangled Scenario Factorization for Multi-Scenario Route Ranking. This paper’s open dataset and implementat…
☆31Sep 9, 2025Updated 10 months ago
AMAP-ML / Omni-WorldBench
View on GitHub
A comprehensive benchmark specifically designed to evaluate the interactive response capabilities of world models in 4D settings.
☆106Mar 24, 2026Updated 3 months ago
AMAP-ML / DCW
View on GitHub
[CVPR 2026] Elucidating the SNR-t Bias of Diffusion Probabilistic Models
☆120Apr 20, 2026Updated 3 months ago
RuiChen96 / FingER
View on GitHub
[ACM MM 25] FingER: Content Aware Fine-grained Evaluation with Reasoning for AI-Generated Videos
☆16Jul 17, 2025Updated last year
AMAP-ML / Thinking-with-Map
View on GitHub
[ACL 2026 Findings] Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization
☆176Mar 9, 2026Updated 4 months ago
AMAP-ML / SpatialGenEval
View on GitHub
[ICLR2026] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models
☆132Jan 30, 2026Updated 5 months ago
AMAP-ML / SocioReasoner
View on GitHub
[ICLR26] Official implementation of the paper "Urban Socio-Semantic Segmentation with Vision-Language Reasoning"
☆175Mar 12, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AMAP-ML / S2-Guidance
View on GitHub
[ICLR2026] Implementation of "S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models"
☆158May 14, 2026Updated 2 months ago
RUC-NLPIR / ARPO
View on GitHub
[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)
☆1,089Jul 13, 2026Updated last week
AMAP-ML / MobilityBench
View on GitHub
[KDD 2026 Oral] MobilityBench: A Scalable Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios
☆156Updated this week
Sugewud / UniMRG
View on GitHub
[ICML 2026] The official implementation of paper "Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation…
☆85May 25, 2026Updated last month
zzfoutofspace / ATPO
View on GitHub
AT2PO: Agentic Turn-based Policy Optimization via Tree Search
☆22May 21, 2026Updated 2 months ago
farisxiong / HS-STaR
View on GitHub
[EMNLP’ 25] Official code for "HS-STaR: Hierarchical Sampling for Self-Taught Reasoners via Difficulty Estimation and Budget Reallocation…
☆37Nov 3, 2025Updated 8 months ago
Int-SR / IntRR
View on GitHub
IntRR:A Framework for Integrating SID Redistribution and Length Reduction
☆39Feb 27, 2026Updated 4 months ago
AMAP-ML / VMBench
View on GitHub
[ICCV 25] VMBench: A Benchmark for Perception-Aligned Video Motion Generation
☆76Oct 10, 2025Updated 9 months ago
THUDM / TreeRL
View on GitHub
TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25
☆97Jun 16, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AMAP-ML / Video-STAR
View on GitHub
[ICLR2026] Video-STAR: Reinforcing Open-Vocabulary Action Recognition with Tools
☆205Apr 17, 2026Updated 3 months ago
multimodal-art-projection / TreePO
View on GitHub
☆65Mar 30, 2026Updated 3 months ago
Alibaba-Quark / SSP
View on GitHub
Search Self-Play: Pushing the Frontier of Agent Capability without Supervision
☆103Mar 4, 2026Updated 4 months ago
Int-SR / IntTravel
View on GitHub
IntTravel: A Real-World Dataset and Generative Framework for Integrated Multi-Task Travel Recommendation
☆58Feb 18, 2026Updated 5 months ago
wangyifei0047 / FASA-ICLR2026
View on GitHub
[ICLR 2026] FASA: FREQUENCY-AWARE SPARSE ATTENTION
☆20Mar 1, 2026Updated 4 months ago
AMAP-ML / UniVG-R1
View on GitHub
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning
☆165Jun 2, 2025Updated last year
AMAP-ML / GPG
View on GitHub
[ICLR26]GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning
☆179Jan 29, 2026Updated 5 months ago