STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.
☆2,116Mar 14, 2026Updated 3 weeks ago
Alternatives and similar repositories for gelab-zero
Users that are interested in gelab-zero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast, Sharp & Reliable Agentic Intelligence☆1,980Updated this week
- MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B☆1,765Mar 20, 2026Updated 2 weeks ago
- A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular…☆535Updated this week
- Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning☆72Dec 30, 2025Updated 3 months ago
- PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning☆337Feb 5, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…☆1,382Mar 16, 2026Updated 3 weeks ago
- Step-DeepResearch☆537Mar 24, 2026Updated 2 weeks ago
- Mobile-Agent: The Powerful GUI Agent Family☆8,408Mar 31, 2026Updated last week
- ☆20Jan 22, 2026Updated 2 months ago
- Align Anything: Training All-modality Model with Feedback☆4,643Nov 27, 2025Updated 4 months ago
- The LLM abstraction layer for modern AI agent applications.☆512Apr 2, 2026Updated last week
- Official code for paper "GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable R…☆50Mar 29, 2026Updated last week
- ☆454Aug 10, 2025Updated 7 months ago
- Pioneering Automated GUI Interaction with Native Agents☆10,024Jan 27, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…☆893Mar 16, 2026Updated 3 weeks ago
- UFO³: Weaving the Digital Agent Galaxy☆8,378Updated this week
- 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.☆1,158Aug 17, 2025Updated 7 months ago
- GraphiContact is a robust method for 3D human reconstruction and contact point prediction from monocular RGB images, utilizing pose-aware…☆51Mar 24, 2026Updated 2 weeks ago
- Official Repository of Native Parallel Reasoner☆105Feb 5, 2026Updated 2 months ago
- The next generation deep reinforcement learning tookit☆3,462Jun 16, 2023Updated 2 years ago
- Klavis AI: MCP integration platforms that let AI agents use tools reliably at any scale☆5,699Updated this week
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆33Aug 13, 2025Updated 7 months ago
- AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient…☆1,348Jan 11, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Nextjs RCE Exploit Kit☆148Feb 13, 2026Updated last month
- [🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …☆655Feb 27, 2026Updated last month
- Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mo…☆7,922Feb 26, 2026Updated last month
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,643Sep 14, 2024Updated last year
- 数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色、数据目录、安全控制等功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持 用户利用代码生成器快速构建自己的业务应用,同时可关联诸…☆2,580Apr 1, 2026Updated last week
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽…☆3,866Mar 22, 2026Updated 2 weeks ago
- The first open autoregressive foundational video AI model.☆2,892Oct 14, 2024Updated last year
- PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.☆3,657Jan 26, 2026Updated 2 months ago
- [ICLR 2026 Oral] ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubuntu…☆1,098Jan 7, 2026Updated 3 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- slime is an LLM post-training framework for RL Scaling.☆5,139Updated this week
- WebResearcher: An Iterative Deep-Research Agent,迭代式深度研究智能体☆48Feb 13, 2026Updated last month
- BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI…☆11,289Updated this week
- Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.☆3,167Dec 15, 2025Updated 3 months ago
- Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its…☆402Jan 21, 2026Updated 2 months ago
- 🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用☆3,412Apr 2, 2026Updated last week
- Accelerate LLM preference tuning via prefix sharing with a single line of code☆51Jul 4, 2025Updated 9 months ago