stepfun-ai/gelab-zero

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/stepfun-ai/gelab-zero)

stepfun-ai / gelab-zero

STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.

☆2,233

Alternatives and similar repositories for gelab-zero

Users that are interested in gelab-zero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

stepfun-ai / PaCoRe
View on GitHub
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning
☆336Feb 5, 2026Updated 5 months ago
stepfun-ai / Step-3.5-Flash
View on GitHub
Fast, Sharp & Reliable Agentic Intelligence
☆2,092Apr 3, 2026Updated 3 months ago
Tongyi-MAI / MAI-UI
View on GitHub
MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B
☆1,823Apr 20, 2026Updated 3 months ago
stepfun-ai / StepDeepResearch
View on GitHub
Step-DeepResearch
☆569Mar 24, 2026Updated 3 months ago
stepfun-ai / Step-Audio-R1
View on GitHub
☆688Apr 29, 2026Updated 2 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
stepfun-ai / Step3-VL-10B
View on GitHub
Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its…
☆407Jan 21, 2026Updated 5 months ago
summonerloong / gelab-engine
View on GitHub
☆21Dec 3, 2025Updated 7 months ago
X-PLUG / MobileAgent
View on GitHub
Mobile-Agent: The Powerful GUI Agent Family
☆8,957Jul 7, 2026Updated last week
stepfun-ai / Step-Audio2
View on GitHub
Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…
☆1,484Mar 16, 2026Updated 4 months ago
stepfun-ai / SteptronOss
View on GitHub
A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular…
☆576May 18, 2026Updated 2 months ago
Tongyi-MAI / MobileWorld
View on GitHub
Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments (ACL 2026)
☆240Jul 2, 2026Updated 2 weeks ago
stepfun-ai / Step-Audio-EditX
View on GitHub
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…
☆951Apr 9, 2026Updated 3 months ago
zai-org / Open-AutoGLM
View on GitHub
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone
☆25,827Mar 6, 2026Updated 4 months ago
OpenBMB / AgentCPM-GUI
View on GitHub
AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient…
☆1,392Jan 11, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
google-research / android_world
View on GitHub
AndroidWorld is an environment and benchmark for autonomous agents
☆826Updated this week
stepfun-ai / Step3
View on GitHub
☆453Aug 10, 2025Updated 11 months ago
bytedance / UI-TARS
View on GitHub
Pioneering Automated GUI Interaction with Native Agents
☆11,202Jan 27, 2026Updated 5 months ago
stepfun-ai / NextStep-1
View on GitHub
[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …
☆689Feb 27, 2026Updated 4 months ago
IPADS-SAI / MobiAgent
View on GitHub
The Intelligent GUI Agent for Mobile Phones
☆1,862Updated this week
aiming-lab / Agent0
View on GitHub
[COLM'26 & ICML'26] Agent0 Series: Self-Evolving Agents from Zero Data
☆1,233Jul 10, 2026Updated last week
bytedance / UI-TARS-desktop
View on GitHub
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
☆38,143Jul 1, 2026Updated 2 weeks ago
VectorSpaceLab / general-agentic-memory
View on GitHub
A general memory system for agents, powered by deep-research
☆857Mar 14, 2026Updated 4 months ago
inclusionAI / UI-Venus
View on GitHub
UI-Venus is a native UI agent designed to perform precise GUI element grounding and effective navigation using only screenshots as input.
☆1,014May 11, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
stepfun-ai / Step-3.7-Flash
View on GitHub
A high-efficiency Flash model for real-world agents.
☆281Jun 1, 2026Updated last month
microsoft / fara
View on GitHub
Fara1.5 – A family of frontier computer use agent models
☆6,014Updated this week
droidrun / mobilerun
View on GitHub
Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖
☆8,829Updated this week
Alibaba-NLP / DeepResearch
View on GitHub
Tongyi Deep Research, the Leading Open-source Deep Research Agent
☆19,691Feb 27, 2026Updated 4 months ago
dabit3 / fabricate
View on GitHub
An experimental research tool for fabricating GitHub personas with AI-generated repositories
☆288Dec 23, 2025Updated 6 months ago
showlab / Awesome-GUI-Agent
View on GitHub
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
☆1,197Aug 17, 2025Updated 11 months ago
xlang-ai / OSWorld-G
View on GitHub
[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis
☆172Jun 18, 2026Updated last month
microsoft / GUI-Actor
View on GitHub
[NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
☆410Apr 13, 2026Updated 3 months ago
qualcomm / GenieX
View on GitHub
Run frontier LLMs and VLMs locally on Qualcomm devices across NPU, GPU, and CPU with a few lines of code
☆8,233Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
microsoft / magentic-ui
View on GitHub
MagenticLite is an experimental agent that works across the browser and local file system
☆9,965Updated this week
zai-org / GLM-V
View on GitHub
GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
☆2,356Updated this week
MadeAgents / mobile-use
View on GitHub
MobileUse: an open-source mobile GUI agent for Android phone automation, AndroidWorld/AndroidLab evaluation, hierarchical reflection, and…
☆167Jul 10, 2026Updated last week
memodb-io / Acontext
View on GitHub
Agent Skills as a Memory Layer
☆3,583Updated this week
xlang-ai / OpenCUA
View on GitHub
[NeurIPS 2025 Spotlight] OpenCUA: Open Foundations for Computer-Use Agents
☆801May 25, 2026Updated last month
ZJU-REAL / Awesome-GUI-Agents
View on GitHub
A curated collection of resources, tools, and frameworks for developing GUI Agents.
☆444Jul 9, 2026Updated last week
xlang-ai / OSWorld
View on GitHub
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
☆3,026Updated this week