☆37May 28, 2025Updated 9 months ago
Alternatives and similar repositories for MME-VideoOCR
Users that are interested in MME-VideoOCR are comparing it to the libraries listed below
Sorting:
- rmp data ranking☆13Nov 4, 2025Updated 4 months ago
- [CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".☆20Jun 16, 2025Updated 8 months ago
- ☆22Feb 3, 2026Updated last month
- Nano Banana Studio: AI-Powered Marketing Asset Creator with Real-Time Brand Enhancement☆39Sep 10, 2025Updated 5 months ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆27Dec 18, 2025Updated 2 months ago
- I know Kung Fu☆22Mar 27, 2025Updated 11 months ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆36Dec 2, 2025Updated 3 months ago
- Official implementation of CVPR 2025 paper "MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes"☆31Feb 24, 2025Updated last year
- adapt data to and from every format☆28Feb 15, 2026Updated 2 weeks ago
- [CVPR 2025] ScaleLSD: Scalable Deep Line Segment Detection Streamlined☆47Sep 25, 2025Updated 5 months ago
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆28Jun 22, 2024Updated last year
- Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation☆31Mar 28, 2025Updated 11 months ago
- An experiment to see if we can process G2 reviews to extract topics from reviews☆10Feb 5, 2024Updated 2 years ago
- [CVPR 2025] Open-World Amodal Appearance Completion☆51Nov 10, 2025Updated 3 months ago
- [CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing☆40Jan 12, 2026Updated last month
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 10 months ago
- High Security Surveillance Camera using OpenCV, Python & Arduino☆12Jun 20, 2020Updated 5 years ago
- ☆11Sep 30, 2024Updated last year
- Script parses Interactive Brokers trade report to aid in Finnish tax report fill☆13Jan 10, 2024Updated 2 years ago
- 🚀 Beautiful React Native UI library☆15Dec 26, 2025Updated 2 months ago
- 非雇员OD管理复盘与面试改进思考☆16Jul 2, 2025Updated 8 months ago
- Statistics and Visualization of acceptance rate, main keyword of CVPR 2023 accepted papers for the main Computer Vision conference (CVPR)☆12May 4, 2023Updated 2 years ago
- A cross-platform ZeroTier desktop client. Build with Tauri, Rust, Vite, React, Zustand, Next UI and Tailwind CSS☆10Oct 24, 2025Updated 4 months ago
- Prompt Free, Soul Driven AI Assistant☆28Feb 19, 2026Updated 2 weeks ago
- A python library that supports all vector databases specifically for LLM apps and frameworks☆13May 3, 2023Updated 2 years ago
- Concurrent TikTok video downloader without watermark. (Snaptik)☆13Dec 16, 2023Updated 2 years ago
- ☆14May 20, 2025Updated 9 months ago
- Object detection and classification☆12Oct 19, 2018Updated 7 years ago
- A cog implementation of mPLUG-Owl🦉, a multimodal large language model☆11May 12, 2023Updated 2 years ago
- TaiYiXLCheckpointLoader: An unoffical node support Taiyi-Diffusion-XL(Taiyi-XL) Chinese-English bilingual language model☆11Sep 1, 2024Updated last year
- ☆13Sep 28, 2024Updated last year
- Machine Learning and Deep Learning with examples.☆10Feb 26, 2019Updated 7 years ago
- Recognize sudoku problem from an image based on OpenCV and Python. 数独图片识别与提取,基于OpenCV和Python☆12Feb 15, 2019Updated 7 years ago
- C++ PyTorch Examples☆10Aug 18, 2019Updated 6 years ago
- Simulation framework for Swarms related application☆11Dec 6, 2022Updated 3 years ago
- A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset. This repo contains scripts …☆13Jul 13, 2022Updated 3 years ago
- Some LaTeX Tips for Writing Research Papers☆10May 30, 2016Updated 9 years ago
- [MICCAI 2024 workshop] Official implementation of "SemiT-SAM: Building a Visual Foundation Model for Tooth Instance Segmentation on Panor…☆15Nov 13, 2024Updated last year
- ☆10Dec 12, 2023Updated 2 years ago