Alibaba-Quark/SSP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Alibaba-Quark/SSP)

Alibaba-Quark / SSP

Search Self-Play: Pushing the Frontier of Agent Capability without Supervision

☆103

Alternatives and similar repositories for SSP

Users that are interested in SSP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Qwen-Applications / DIR
View on GitHub
☆17Feb 14, 2026Updated 5 months ago
Qwen-Applications / MARCH
View on GitHub
☆28Jun 9, 2026Updated last month
Qwen-Applications / GD2PO
View on GitHub
☆20Jun 16, 2026Updated last month
Qwen-Applications / CLIPO
View on GitHub
CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR
☆21Apr 7, 2026Updated 3 months ago
Qwen-Applications / SSP
View on GitHub
Search Self-Play: Pushing the Frontier of Agent Capability without Supervision
☆20Dec 30, 2025Updated 6 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
DeepExperience / agent2world
View on GitHub
🪐 Agent2World: Learning to Generate Symbolic World Models via Adaptive Multi-Agent Feedback
☆23Jan 29, 2026Updated 5 months ago
YuyaoZhangQAQ / QCompiler
View on GitHub
This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.
☆17Oct 20, 2025Updated 9 months ago
DeepExperience / REAL
View on GitHub
Rewards as Labels: Revisiting RLVR from a Classification Perspective
☆24Jun 26, 2026Updated last month
AMAP-ML / Tree-GRPO
View on GitHub
[ICLR 2026] Tree Search for LLM Agent Reinforcement Learning
☆387Jan 26, 2026Updated 6 months ago
THUDM / AgentRL
View on GitHub
Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
☆324Jan 17, 2026Updated 6 months ago
smallporridge / TrustworthyRAG
View on GitHub
☆16May 18, 2026Updated 2 months ago
AMAP-ML / GPG
View on GitHub
[ICLR26]GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning
☆179Jan 29, 2026Updated 5 months ago
RUC-NLPIR / ARPO
View on GitHub
[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)
☆1,092Jul 13, 2026Updated last week
DeepExperience / HyperEyes
View on GitHub
HyperEyes is a parallel multimodal search agent that fuses visual grounding and retrieval into a single atomic action, enabling concurren…
☆70May 23, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Linear95 / DSP
View on GitHub
Domain-specific preference (DSP) data and customized RM fine-tuning.
☆25Mar 7, 2024Updated 2 years ago
callsys / GMPO
View on GitHub
[ICLR 2026] Geometric-Mean Policy Optimization
☆104Jan 26, 2026Updated 6 months ago
zwhe99 / LLM-MT-Eval
View on GitHub
{DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}
☆14Jun 18, 2023Updated 3 years ago
inclusionAI / ASearcher
View on GitHub
An Open-Source Large-Scale Reinforcement Learning Project for Search Agents
☆602Nov 26, 2025Updated 8 months ago
Jiahao004 / DeepTheorem
View on GitHub
☆27Jun 10, 2025Updated last year
Mizersy / RepoDeepSearch
View on GitHub
☆44Oct 28, 2025Updated 8 months ago
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Jul 19, 2026Updated last week
ventr1c / Awesome-RL-based-Agentic-Search-Papers
View on GitHub
The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Eva…
☆279Updated this week
chanchimin / AgentMonitor
View on GitHub
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Dec 13, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Qwen-Applications / Trace2Skill
View on GitHub
Official codebase of the paper -- Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills
☆207May 1, 2026Updated 2 months ago
OPPO-PersonalAI / Flash-Searcher
View on GitHub
Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution
☆88Dec 8, 2025Updated 7 months ago
DingWu1021 / Promsa
View on GitHub
[ECCV 2026] Promsa: Progressive Multimodal Search Agents for Knowledge-Based Visual Question Answering
☆91Jul 7, 2026Updated 2 weeks ago
RUC-NLPIR / SmartSearch
View on GitHub
☆46Jan 19, 2026Updated 6 months ago
dengmengjie / ToolScope
View on GitHub
Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use
☆31Nov 4, 2025Updated 8 months ago
RUC-NLPIR / OmniGAIA
View on GitHub
OmniGAIA: Towards Native Omni-Modal AI Agents
☆138Apr 2, 2026Updated 3 months ago
XueruiSu / Trust-Region-Preference-Approximation
View on GitHub
Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning
☆15Jun 28, 2025Updated last year
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆5,153Nov 13, 2025Updated 8 months ago
wangyifei0047 / FASA-ICLR2026
View on GitHub
[ICLR 2026] FASA: FREQUENCY-AWARE SPARSE ATTENTION
☆20Mar 1, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wangyifei0047 / Pos2Distill-
View on GitHub
[EMNLP25] Official code for "POSITION BIAS MITIGATES POSITION BIAS: Mitigate Position Bias Through Inter-Position Knowledge Distillation…
☆38Nov 11, 2025Updated 8 months ago
SLIT-AI / WRPO
View on GitHub
[ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion
☆14Mar 17, 2025Updated last year
EvolvingLMMs-Lab / multimodal-search-r1
View on GitHub
[ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal…
☆470Apr 7, 2026Updated 3 months ago
mzf666 / MATPO
View on GitHub
Official implementation of MATPO: Multi-Agent Tool-Integrated Policy Optimization.
☆82Oct 31, 2025Updated 8 months ago
facebookresearch / drzero
View on GitHub
Dr. Zero Self-Evolving Search Agents without Training Data
☆525Mar 23, 2026Updated 4 months ago
AgentDS / Awesome-Mess
View on GitHub
An awesome & curated list of anything that might be useful for computer science students
☆13Mar 27, 2023Updated 3 years ago
SLIT-AI / FuseChat-3.0
View on GitHub
☆18Apr 18, 2025Updated last year