xmu-xiaoma666 / Multimodal-Open-O1View external linksLinks
Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool works locally and aims to create inference chains akin to those used by OpenAI-o1, but with localized processing power.
☆29Sep 25, 2024Updated last year
Alternatives and similar repositories for Multimodal-Open-O1
Users that are interested in Multimodal-Open-O1 are comparing it to the libraries listed below
Sorting:
- 本项目旨在构建一套多场景下可复用的辅助决策型智能 Agent 系统。通过提取用户输入的关键信息,结合历史数据进行智能匹配,系统可在教育路径、法律咨询、金融投资、心理健康、企业经营、供应链优化、危机应对、智能客服等多个领域提供个性化决策建议。系统采用统一的决策流程设计,具备高…☆21Jul 22, 2025Updated 6 months ago
- ☆21Dec 10, 2025Updated 2 months ago
- [NeurIPS 2023] Official PyTorch implementation for the paper "CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganog…☆11Sep 28, 2023Updated 2 years ago
- RelayGS: Reconstructing Dynamic Scenes with Large-Scale and Complex Motions via Relay Gaussians☆14Dec 5, 2024Updated last year
- [IJCV 2025] OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation☆16Dec 9, 2025Updated 2 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Feb 9, 2026Updated last week
- [ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs☆19Mar 20, 2025Updated 10 months ago
- Official implementation of Geometry Cloak [NeurIPS'24]☆24Apr 16, 2025Updated 10 months ago
- A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.☆74Oct 14, 2024Updated last year
- 包含目标检测前处理与后处理☆20Aug 24, 2021Updated 4 years ago
- Code for paper OmniSSR☆23Apr 21, 2025Updated 9 months ago
- Official implementation of NeRFProtector [ECCV'24]☆22Aug 27, 2024Updated last year
- ☆72Updated this week
- The official repo for "Unified Domain Adaptive Semantic Segmentation" (IEEE TPAMI 2025)☆33Aug 14, 2025Updated 6 months ago
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆52Dec 30, 2023Updated 2 years ago
- (Nature Communications Engineering 2024) Compressive Confocal Microscopy Imaging at the Single-Photon Level with Ultra-Low Sampling Ratio…☆26Mar 9, 2025Updated 11 months ago
- 小智的视觉对话☆32Apr 25, 2025Updated 9 months ago
- support BM25+vecetor☆29May 26, 2025Updated 8 months ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- ☆189Feb 5, 2026Updated last week
- Dynamic Scene Representation Gaussian Splatting☆37Jun 30, 2025Updated 7 months ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- [CVPR 2025] OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking☆46Apr 26, 2025Updated 9 months ago
- Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"☆29Aug 28, 2023Updated 2 years ago
- [CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models☆233Nov 7, 2025Updated 3 months ago
- 基于paddlex目标检测的工业场景下违规使用手机识别。☆11Jun 11, 2022Updated 3 years ago
- ☆12Sep 19, 2022Updated 3 years ago
- Code for running experiments and benchmarking on GNNExplainer: Generating Explanations for Graph Neural Networks☆15May 8, 2021Updated 4 years ago
- ☆11May 16, 2025Updated 9 months ago
- Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models☆37Sep 19, 2023Updated 2 years ago
- A django-yolov5 starter webapp. Based on yolov5-flask example.☆11Mar 6, 2022Updated 3 years ago
- ☆22Dec 11, 2025Updated 2 months ago
- 本项目基于RuoYi-Vue框架为xiaozhi-esp32提供Java后端聊天服务器。帮助个人、企业快速部署的xiaozhi-esp32后端服务。☆21Jun 19, 2025Updated 7 months ago
- ROS1 implementation of Foundation Pose for model-based 6DoF pose tracking☆18Feb 7, 2025Updated last year
- 一个用YOLO足球视频分析的任务,检测视频中的人与球。 A task of football video analysis to detect people and balls in the video with YOLO☆12Sep 5, 2020Updated 5 years ago
- Getting started with MIMIC-III Critical Care Database☆12Mar 3, 2019Updated 6 years ago
- Research sources on graph-based anomaly detection☆13Nov 29, 2022Updated 3 years ago