☆38May 28, 2025Updated 10 months ago
Alternatives and similar repositories for MME-VideoOCR
Users that are interested in MME-VideoOCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Feb 3, 2026Updated last month
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 11 months ago
- Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation☆31Mar 28, 2025Updated last year
- rmp data ranking☆13Nov 4, 2025Updated 4 months ago
- Statistics and Visualization of acceptance rate, main keyword of CVPR 2023 accepted papers for the main Computer Vision conference (CVPR)☆12May 4, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Simulation framework for Swarms related application☆11Dec 6, 2022Updated 3 years ago
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.☆12May 2, 2024Updated last year
- 📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, AAAI, IJCAI, ICML, AAMAS, ICLR, ICRA, etc. | (AI…☆11Aug 20, 2023Updated 2 years ago
- Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents☆29Nov 24, 2025Updated 4 months ago
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆31Oct 2, 2025Updated 5 months ago
- ☆11Mar 12, 2025Updated last year
- [CVPR 2026]☆44Updated this week
- Variationist: Exploring Multifaceted Variation and Bias in Written Language Data (ACL 2024 demo track)☆10Jan 31, 2026Updated last month
- Revolve: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization☆22Dec 13, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- The Next Step Forward in Multimodal LLM Alignment☆199May 1, 2025Updated 10 months ago
- Implementation of the paper "Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in Mixed Coo…☆17Dec 7, 2024Updated last year
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆29Dec 18, 2025Updated 3 months ago
- I know Kung Fu☆24Mar 27, 2025Updated last year
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆39Dec 2, 2025Updated 3 months ago
- Implementation of the paper "WToE: Learning When to Explore in Multi-Agent Reinforcement Learning"☆21Aug 17, 2024Updated last year
- Nano Banana Studio: AI-Powered Marketing Asset Creator with Real-Time Brand Enhancement☆39Sep 10, 2025Updated 6 months ago
- Search Engine Guided Non-Parametric Neural Machine Translation☆14Oct 23, 2017Updated 8 years ago
- ☆38Mar 30, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated 11 months ago
- [CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".☆23Jun 16, 2025Updated 9 months ago
- A LaTex paper template for security and machine learning conferences☆24Jan 24, 2026Updated 2 months ago
- Implementation of the paper "Multi-Agent Exploration via Self-Learning and Social Learning"☆20Dec 7, 2024Updated last year
- Implementation of the paper "Egoism, Utilitarianism and Egalitarianism in Multi-Agent Reinforcement Learning"☆21Aug 17, 2024Updated last year
- ☆11May 19, 2025Updated 10 months ago
- Chunk-based neural machine translation☆17Apr 24, 2022Updated 3 years ago
- ☆37Nov 18, 2025Updated 4 months ago
- ☆22Feb 13, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆28Feb 21, 2022Updated 4 years ago
- CC07-单元☆29Apr 21, 2021Updated 4 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Jul 22, 2021Updated 4 years ago
- Recursive Neural Networks for PyTorch☆31Feb 18, 2020Updated 6 years ago
- adapt data to and from every format☆28Feb 15, 2026Updated last month
- ☆15Oct 9, 2023Updated 2 years ago
- ☆14May 20, 2025Updated 10 months ago