ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands
☆97Feb 6, 2026Updated last month
Alternatives and similar repositories for showui-pi
Users that are interested in showui-pi are comparing it to the libraries listed below
Sorting:
- ☆11Dec 11, 2023Updated 2 years ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated 3 weeks ago
- ☆13Sep 2, 2023Updated 2 years ago
- Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]☆75Aug 2, 2025Updated 7 months ago
- https://avocado-captioner.github.io/☆30Oct 16, 2025Updated 4 months ago
- [CVPR 2026] FrankenMotion: Part-level Human Motion Generation and Composition☆198Feb 27, 2026Updated last week
- [AAAI 2026] UltraGen☆77Feb 1, 2026Updated last month
- CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation☆20Feb 2, 2024Updated 2 years ago
- FQGAN: Factorized Visual Tokenization and Generation☆59Mar 29, 2025Updated 11 months ago
- ☆86Feb 4, 2026Updated last month
- Official PyTorch Implementation of Ctrl-Crash 💥☆51Jun 3, 2025Updated 9 months ago
- Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation☆111Apr 16, 2025Updated 10 months ago
- Video Reasoning Segmentation☆28Nov 29, 2024Updated last year
- [ICLR 2026] NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks☆136Oct 20, 2025Updated 4 months ago
- Code of WinT3R: Window-Based Streaming Rrconstruction With Camera Token Pool☆222Updated this week
- ☆370Jul 25, 2025Updated 7 months ago
- Code for 'JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion'☆213Feb 10, 2026Updated 3 weeks ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆69Feb 26, 2026Updated last week
- Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"☆65Dec 17, 2025Updated 2 months ago
- Animate Any Character in Any World☆90Jan 9, 2026Updated last month
- Autonomous Rescue Robot☆17Nov 16, 2022Updated 3 years ago
- PainterVRAM lets you reserve a slice of GPU memory before ComfyUI starts processing, preventing out-of-memory crashes. Switch between man…☆27Jan 2, 2026Updated 2 months ago
- A fork of BlenderProc used in the GRADE framework to generate environments and export some additional information for processing.☆10Mar 9, 2023Updated 2 years ago
- ToonOut, a fork of BiRefNet focused on background removal for anime images. We open-source our dataset & our weights. See our paper at: h…☆82Sep 10, 2025Updated 5 months ago
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 3 months ago
- [CVPR 2024] BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation☆45May 7, 2024Updated last year
- Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection (ICCV23)☆45Oct 19, 2023Updated 2 years ago
- The official generation code and toolkits of VDW dataset (ICCV 2023)☆35Jul 6, 2024Updated last year
- One-shot and Few-shot 3D Editing without Per-Scene Optimization☆164Aug 21, 2025Updated 6 months ago
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".☆84Jul 10, 2025Updated 7 months ago
- [CVPR 2026] SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time☆99Jan 1, 2026Updated 2 months ago