Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device
☆75Updated this week
Alternatives and similar repositories for Mobile-O
Users that are interested in Mobile-O are comparing it to the libraries listed below
Sorting:
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆54Feb 11, 2026Updated 2 weeks ago
- sora2 free watermark remover☆767Feb 20, 2026Updated last week
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 10 months ago
- ☆14Feb 13, 2026Updated 2 weeks ago
- Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion☆12Jan 14, 2026Updated last month
- Protocol buffers and other common resources.☆13Jan 20, 2026Updated last month
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆12Dec 31, 2024Updated last year
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆32Nov 11, 2025Updated 3 months ago
- The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'☆10Jan 23, 2024Updated 2 years ago
- PyTorch implementation of the paper "Region-Aware Portrait Retouching with Sparse Interactive Guidance“ published in IEEE Transactions o…☆15Jun 14, 2023Updated 2 years ago
- 在 Mirai Console 中使用MCL管理包和其他高 级功能☆10Nov 13, 2022Updated 3 years ago
- Local 4B codebase explorer agent distilled from Qwen3-Coder-Next.☆70Updated this week
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Jan 21, 2021Updated 5 years ago
- Official code repo for the paper "MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments"☆22Feb 15, 2026Updated last week
- Implementation of 'FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention'☆11Mar 27, 2024Updated last year
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated 2 weeks ago
- 🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.☆13Jul 12, 2025Updated 7 months ago
- The repo of the Doc2SoarGraph framework☆10Sep 17, 2024Updated last year
- [ICML 2025] Efficiently Serving Large Multimodal Models Using EPD Disaggregation☆22May 29, 2025Updated 8 months ago
- Robust Change Captioning in Remote Sensing: SECOND-CC Dataset and MModalCC Framework☆19Sep 8, 2025Updated 5 months ago
- [CVPR 2026] Official Implementation of "Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models".☆15Updated this week
- ☆35Feb 12, 2026Updated 2 weeks ago
- ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/vide…☆20May 5, 2024Updated last year
- Minute-long video generation at 24FPS.☆50Feb 2, 2026Updated 3 weeks ago
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆33Feb 4, 2026Updated 3 weeks ago
- clustering algorithm implementation☆13Nov 3, 2025Updated 3 months ago
- OpenAI compatible API for open source LLMs☆16Oct 30, 2023Updated 2 years ago
- [CVPR 2024] No More Ambiguity in 360° Room Layout via Bi-Layout Estimation☆17Oct 9, 2024Updated last year
- A no-dependency utility to undervolt Intel CPUs on Linux systems, with user-friendly GUI☆16Apr 19, 2025Updated 10 months ago
- A statistical framework for graph anomaly detection.☆17Sep 23, 2018Updated 7 years ago
- Empowering everyone to create reliable and safety AI coding agent.☆12Sep 2, 2024Updated last year
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆41Feb 10, 2026Updated 2 weeks ago
- collab-dev - Collaboration Metrics for Code Reviews☆23May 12, 2025Updated 9 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month
- This project demonstrates a real-time delivery location tracking system similar to Zomato/Swiggy, built using Spring Boot and Apache Kafk…☆28Dec 4, 2025Updated 2 months ago
- 🏆1st place in the PANORAMA challenge (early detection of PDAC on contrast-enhanced CT)☆15Jan 13, 2026Updated last month
- ☆44Feb 12, 2026Updated 2 weeks ago
- An ecosystem of Rust libraries for working with large language models☆14Oct 2, 2023Updated 2 years ago
- The official repo for the DanQing dataset.☆29Jan 16, 2026Updated last month