STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.
☆2,206May 11, 2026Updated last month
Alternatives and similar repositories for gelab-zero
Users that are interested in gelab-zero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B☆1,821Apr 20, 2026Updated 2 months ago
- A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular…☆575May 18, 2026Updated last month
- Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning☆76Apr 20, 2026Updated 2 months ago
- Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…☆1,466Mar 16, 2026Updated 3 months ago
- PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning☆336Feb 5, 2026Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Mobile-Agent: The Powerful GUI Agent Family☆8,864May 14, 2026Updated last month
- Step-DeepResearch☆566Mar 24, 2026Updated 3 months ago
- ☆20Jan 22, 2026Updated 5 months ago
- Align Anything: Training All-modality Model with Feedback☆4,661Nov 27, 2025Updated 7 months ago
- The LLM abstraction layer for modern AI agent applications.☆517Jun 22, 2026Updated last week
- ☆42May 9, 2026Updated last month
- ☆453Aug 10, 2025Updated 10 months ago
- A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…☆938Apr 9, 2026Updated 2 months ago
- Pioneering Automated GUI Interaction with Native Agents☆11,053Jan 27, 2026Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- UFO³: Weaving the Digital Agent Galaxy☆9,098Jun 22, 2026Updated last week
- 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.☆1,194Aug 17, 2025Updated 10 months ago
- [🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …☆690Feb 27, 2026Updated 4 months ago
- [ICML 2026] Reasoning in Parallelism via Self-Distilled RL☆114Updated this week
- The next generation deep reinforcement learning tookit☆3,463Jun 16, 2023Updated 3 years ago
- Klavis AI: MCP integration platforms that let AI agents use tools reliably at any scale☆5,763Jun 1, 2026Updated 3 weeks ago
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆34Aug 13, 2025Updated 10 months ago
- Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its…☆409Jan 21, 2026Updated 5 months ago
- AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient…☆1,379Jan 11, 2026Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Nextjs RCE Exploit Kit☆148Jun 18, 2026Updated last week
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,651Sep 14, 2024Updated last year
- Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mo…☆8,109Updated this week
- 数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色、数据目录、安全控制等功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸…☆2,596Updated this week
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽…☆3,873Jun 22, 2026Updated last week
- The first open autoregressive foundational video AI model.☆2,892Oct 14, 2024Updated last year
- slime is an LLM post-training framework for RL Scaling.☆7,099Updated this week
- [CVPR 2026] PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.☆3,714Jun 10, 2026Updated 2 weeks ago
- [ICLR 2026 Oral] ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubuntu…☆1,120Jan 7, 2026Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI…☆11,473Updated this week
- 🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用☆3,698May 22, 2026Updated last month
- ☆37Mar 7, 2025Updated last year
- Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.☆3,159Dec 15, 2025Updated 6 months ago
- WebResearcher: An Iterative Deep-Research Agent,迭代式深度研究智能体☆48Feb 13, 2026Updated 4 months ago
- Accelerate LLM preference tuning via prefix sharing with a single line of code☆52Jul 4, 2025Updated 11 months ago
- The open source platform for AI-native application development.☆5,382Dec 2, 2024Updated last year