FrankYang-17 / MME-VideoOCRView external linksLinks
☆37May 28, 2025Updated 8 months ago
Alternatives and similar repositories for MME-VideoOCR
Users that are interested in MME-VideoOCR are comparing it to the libraries listed below
Sorting:
- rmp data ranking☆13Nov 4, 2025Updated 3 months ago
- [CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".☆20Jun 16, 2025Updated 7 months ago
- ☆21Feb 3, 2026Updated last week
- Nano Banana Studio: AI-Powered Marketing Asset Creator with Real-Time Brand Enhancement☆39Sep 10, 2025Updated 5 months ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆26Dec 18, 2025Updated last month
- I know Kung Fu☆21Mar 27, 2025Updated 10 months ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆33Dec 2, 2025Updated 2 months ago
- Official implementation of CVPR 2025 paper "MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes"☆31Feb 24, 2025Updated 11 months ago
- adapt data to and from every format☆28Oct 15, 2025Updated 4 months ago
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆28Jun 22, 2024Updated last year
- [CVPR 2025] ScaleLSD: Scalable Deep Line Segment Detection Streamlined☆44Sep 25, 2025Updated 4 months ago
- Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation☆31Mar 28, 2025Updated 10 months ago
- An experiment to see if we can process G2 reviews to extract topics from reviews☆10Feb 5, 2024Updated 2 years ago
- [CVPR 2025] Open-World Amodal Appearance Completion☆50Nov 10, 2025Updated 3 months ago
- [CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing☆40Jan 12, 2026Updated last month
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 10 months ago
- High Security Surveillance Camera using OpenCV, Python & Arduino☆12Jun 20, 2020Updated 5 years ago
- Script parses Interactive Brokers trade report to aid in Finnish tax report fill☆13Jan 10, 2024Updated 2 years ago
- ☆11Sep 30, 2024Updated last year
- Prompt Free, Soul Driven AI Assistant☆29Feb 8, 2026Updated last week
- Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrati…☆16Feb 4, 2026Updated last week
- ☆32Nov 18, 2025Updated 2 months ago
- ☆13Sep 28, 2024Updated last year
- C++ PyTorch Examples☆10Aug 18, 2019Updated 6 years ago
- Machine Learning and Deep Learning with examples.☆10Feb 26, 2019Updated 6 years ago
- 🚀 Beautiful React Native UI library☆15Dec 26, 2025Updated last month
- Recognize sudoku problem from an image based on OpenCV and Python. 数独图片识别与提取,基于OpenCV和Python☆12Feb 15, 2019Updated 7 years ago
- TaiYiXLCheckpointLoader: An unoffical node support Taiyi-Diffusion-XL(Taiyi-XL) Chinese-English bilingual language model☆11Sep 1, 2024Updated last year
- A cog implementation of mPLUG-Owl🦉, a multimodal large language model☆11May 12, 2023Updated 2 years ago
- A python library that supports all vector databases specifically for LLM apps and frameworks☆13May 3, 2023Updated 2 years ago
- Statistics and Visualization of acceptance rate, main keyword of CVPR 2023 accepted papers for the main Computer Vision conference (CVPR)☆12May 4, 2023Updated 2 years ago
- Simulation framework for Swarms related application☆11Dec 6, 2022Updated 3 years ago
- A cross-platform ZeroTier desktop client. Build with Tauri, Rust, Vite, React, Zustand, Next UI and Tailwind CSS☆10Oct 24, 2025Updated 3 months ago
- 非雇员OD管理复盘与面试改进思考☆16Jul 2, 2025Updated 7 months ago
- Object detection and classification☆12Oct 19, 2018Updated 7 years ago
- Concurrent TikTok video downloader without watermark. (Snaptik)☆13Dec 16, 2023Updated 2 years ago
- ☆14May 20, 2025Updated 8 months ago
- ☆10Dec 12, 2023Updated 2 years ago
- a graph definition and execution library for python☆16Mar 22, 2023Updated 2 years ago