Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments
☆47Feb 14, 2025Updated last year
Alternatives and similar repositories for ocr-benchmark
Users that are interested in ocr-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An open-source agent toolkit that auto-syncs SDK versions, docs, and examples—built for seamless integration with LLMs, and AI agents ( M…☆47Updated this week
- MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models, sloppily ported to cog/replicate☆12Apr 25, 2023Updated 2 years ago
- A blog where I write about research papers and blog posts I read.☆12Nov 20, 2024Updated last year
- A collection of noise designs for diffusion models [Eurographics Tutorial / SIGGRAPH Course, 2025]☆28May 26, 2025Updated 10 months ago
- ☆10Apr 16, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆19Sep 29, 2025Updated 6 months ago
- Official implementation of "Unified Diffusion Transformer for High-Fidelity Text-Aware Image Restoration"☆27Dec 22, 2025Updated 3 months ago
- ☆14May 18, 2025Updated 10 months ago
- ☆19Mar 25, 2025Updated last year
- ☆16Apr 7, 2024Updated last year
- Ayle Chat is a custom-built AI chat application leveraging the power of Groq and Exa Search for unparalleled speed and providing immediat…☆11Dec 15, 2025Updated 3 months ago
- Personal website *always* under construction☆15Apr 13, 2025Updated 11 months ago
- Python package providing functionality and plotting for chemistry method comparison☆16Feb 28, 2024Updated 2 years ago
- ☆11Aug 23, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Instantly create video clips from LLM prompts☆171Aug 22, 2024Updated last year
- Compile simple Shadertoys into small .COM MS-DOS executables☆10May 13, 2024Updated last year
- ☆20Jun 21, 2024Updated last year
- Play games right in your X feed☆15May 2, 2025Updated 10 months ago
- Build your Team's CLI☆17Mar 23, 2026Updated last week
- re-implementation of instantsplat (unofficial)☆16Aug 5, 2024Updated last year
- Large-scale remote sensing image target recognition and automatic annotation☆12Nov 23, 2024Updated last year
- A system prompt approach enabling Large Language Models (LLMs) to perform post-response reasoning without additional training☆17Feb 13, 2025Updated last year
- Python Template for Competitive Programming☆13Jun 8, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICASSP2024] An official implement of the paper "EFFICIENT SCENE TEXT IMAGE SUPER-RESOLUTION WITH SEMANTIC GUIDANCE"☆25May 12, 2024Updated last year
- Code for our paper "Fixed-point Inversion for Text-to-image diffusion models"☆19Oct 13, 2024Updated last year
- Bookdown for container☆11Nov 11, 2024Updated last year
- [ACM MM 2025] LIDAR: Lightweight Adaptive Cue-Aware Fusion Vision Mamba for Multimodal Segmentation of Structural Cracks☆22Nov 18, 2025Updated 4 months ago
- Implemeting Meta AI's VGGT as a FiftyOne Remote Zoo Model☆20Jun 20, 2025Updated 9 months ago
- Flutter App capable of classifying 400 different species of birds using MobileNetV2. Upload an image from Google Drive or your phone gall…☆12Dec 10, 2022Updated 3 years ago
- The official repository of the WACV2024 paper "Scene Text Image Super-resolution based on Text-conditional Diffusion Models"☆22Jan 15, 2024Updated 2 years ago
- Official code for PEEKABOO2: Adapting Peekaboo with Segment Anything Model for Unsupervised Object Localization in Images and Videos.☆30Dec 27, 2025Updated 3 months ago
- Building self-refined guardrails via DSPy☆14Jul 2, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- CodeMerge is useful for consolidating code from various files into a single file that can be used as context for AI code generation model…☆24Feb 28, 2026Updated last month
- ☆13Oct 12, 2023Updated 2 years ago
- Build tools for LLMs in Rust using Model Context Protocol☆37Feb 25, 2025Updated last year
- A project to take an audio file and separate it into speakers and play it with avatars and save the recording as an mp4 for sharing on so…☆12Nov 6, 2024Updated last year
- This code is published for skyline detection☆26Mar 19, 2026Updated last week
- CLV prediction with pareto-NBD model☆12Jul 1, 2016Updated 9 years ago
- Official Implementation of "Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation"☆49Jan 29, 2026Updated 2 months ago