STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.
☆2,147Apr 29, 2026Updated this week
Alternatives and similar repositories for gelab-zero
Users that are interested in gelab-zero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast, Sharp & Reliable Agentic Intelligence☆2,012Apr 3, 2026Updated 3 weeks ago
- MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B☆1,788Apr 20, 2026Updated last week
- A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular…☆559Updated this week
- Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…☆1,399Mar 16, 2026Updated last month
- Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning☆74Apr 20, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning☆336Feb 5, 2026Updated 2 months ago
- Step-DeepResearch☆550Mar 24, 2026Updated last month
- Mobile-Agent: The Powerful GUI Agent Family☆8,578Apr 14, 2026Updated 2 weeks ago
- ☆21Jan 22, 2026Updated 3 months ago
- Align Anything: Training All-modality Model with Feedback☆4,649Nov 27, 2025Updated 5 months ago
- The LLM abstraction layer for modern AI agent applications.☆521Updated this week
- ☆453Aug 10, 2025Updated 8 months ago
- A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…☆908Apr 9, 2026Updated 3 weeks ago
- Pioneering Automated GUI Interaction with Native Agents☆10,132Jan 27, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official code for paper "GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable R…☆59Mar 29, 2026Updated last month
- UFO³: Weaving the Digital Agent Galaxy☆8,508Apr 14, 2026Updated 2 weeks ago
- 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.☆1,172Aug 17, 2025Updated 8 months ago
- [🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …☆679Feb 27, 2026Updated 2 months ago
- Official Repository of Native Parallel Reasoner☆107Feb 5, 2026Updated 2 months ago
- The next generation deep reinforcement learning tookit☆3,464Jun 16, 2023Updated 2 years ago
- Klavis AI: MCP integration platforms that let AI agents use tools reliably at any scale☆5,718Updated this week
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆34Aug 13, 2025Updated 8 months ago
- Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its…☆403Jan 21, 2026Updated 3 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient…☆1,359Jan 11, 2026Updated 3 months ago
- Nextjs RCE Exploit Kit☆148Feb 13, 2026Updated 2 months ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,648Sep 14, 2024Updated last year
- Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mo…☆7,968Apr 14, 2026Updated 2 weeks ago
- 数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色、数据目录、安全控制等功能构建的统一且安全的管理支撑平台。数字底座基于三员 管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸…☆2,584Updated this week
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽…☆3,867Mar 22, 2026Updated last month
- The first open autoregressive foundational video AI model.☆2,891Oct 14, 2024Updated last year
- [ICLR 2026 Oral] ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubuntu…☆1,106Jan 7, 2026Updated 3 months ago
- [CVPR 2026] PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.☆3,669Jan 26, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI…☆11,336Updated this week
- slime is an LLM post-training framework for RL Scaling.☆5,490Updated this week
- Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.☆3,162Dec 15, 2025Updated 4 months ago
- 🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用☆3,463Apr 9, 2026Updated 2 weeks ago
- Accelerate LLM preference tuning via prefix sharing with a single line of code☆51Jul 4, 2025Updated 9 months ago
- The open source platform for AI-native application development.☆5,379Dec 2, 2024Updated last year
- [ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation☆3,695Feb 27, 2025Updated last year