Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool works locally and aims to create inference chains akin to those used by OpenAI-o1, but with localized processing power.
☆29Sep 25, 2024Updated last year
Alternatives and similar repositories for Multimodal-Open-O1
Users that are interested in Multimodal-Open-O1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RelayGS: Reconstructing Dynamic Scenes with Large-Scale and Complex Motions via Relay Gaussians☆13Dec 5, 2024Updated last year
- [NeurIPS 2023] Official PyTorch implementation for the paper "CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganog…☆11Sep 28, 2023Updated 2 years ago
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆24Apr 13, 2026Updated 3 weeks ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- Official implementation of Geometry Cloak [NeurIPS'24]☆24Apr 16, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.☆74Oct 14, 2024Updated last year
- Official implementation of NeRFProtector [ECCV'24]☆22Aug 27, 2024Updated last year
- 本项目旨在构建一套多场景下可复用的辅助决策型智能 Agent 系统。通过提取用户输入的关键信息,结合历史数据进行智能匹配,系统可在教育路径、法律咨询、金融投资、心理健康、企业经营、供应链优化、危机应对、智能客服等多个领域提供个性化决策建议。系统采用统一的决策流程设计,具备高…☆26Mar 7, 2026Updated 2 months ago
- [CVPR2025] Code Release for "FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting"☆46Jun 20, 2025Updated 10 months ago
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆33Dec 9, 2025Updated 5 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Apr 24, 2026Updated 2 weeks ago
- (Nature Communications Engineering 2024) Compressive Confocal Microscopy Imaging at the Single-Photon Level with Ultra-Low Sampling Ratio…☆27Mar 9, 2025Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- [ACM MM 24 Best Paper Nomination] ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images☆24Dec 9, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2025] OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking☆49Apr 26, 2025Updated last year
- Dynamic Scene Representation Gaussian Splatting☆39Jun 30, 2025Updated 10 months ago
- Using Groq or OpenAI or Ollama to create o1-like reasoning chains☆290Sep 17, 2024Updated last year
- [CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models☆240Nov 7, 2025Updated 6 months ago
- Advances in recent large vision language models (LVLMs)☆15Sep 23, 2024Updated last year
- ☆20Apr 24, 2024Updated 2 years ago
- [ICCV 2025 DeepID Challenge] Official 1st Place in both tracks (Detection & Localization)☆18Apr 4, 2026Updated last month
- [NeurIPS 2024] HiCoM: Hierarchical Coherent Motion for Dynamic Streamable Scenes with 3D Gaussian Splatting☆45Dec 24, 2024Updated last year
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The official repo for "Unified Domain Adaptive Semantic Segmentation" (IEEE TPAMI 2025)☆34Aug 14, 2025Updated 8 months ago
- PyTorch implementation of “Purified and Unified Steganographic Networks” (CVPR 2024)☆43Jul 14, 2024Updated last year
- [AAAI2025] Revisiting Tampered Scene Text Detection in the Era of Generative AI☆68Apr 27, 2026Updated last week
- ⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文☆12Oct 27, 2024Updated last year
- Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs (ECCV 2024)☆19Jul 15, 2024Updated last year
- 包含目标检测前处理与后处理☆20Aug 24, 2021Updated 4 years ago
- ☆192Mar 13, 2026Updated last month
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated last year
- Placeholder☆10Jul 17, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The offical implementation of 'FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant'☆49Nov 22, 2024Updated last year
- Segment Anything (SAM) at Home web app using Gradio☆14Aug 7, 2023Updated 2 years ago
- [TCSVT 2024] D3C2-Net: Dual-Domain Deep Convolutional Coding Network for Compressive Sensing☆44Dec 9, 2024Updated last year
- Creating Custom Action Recognition Model using TensorFlow (CNN + LSTM)☆12Feb 22, 2023Updated 3 years ago
- Urban Generative Intelligence (UGI): A Foundational Platform for Embodied Agent and Future City☆12Dec 17, 2023Updated 2 years ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"☆29Aug 28, 2023Updated 2 years ago