Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments
☆47Feb 14, 2025Updated last year
Alternatives and similar repositories for ocr-benchmark
Users that are interested in ocr-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A project on Object detection using Y.O.L.O. and Object tracking using Kalman filters is developed.☆10Dec 5, 2018Updated 7 years ago
- ☆19Sep 29, 2025Updated 9 months ago
- Official implementation of "Unified Diffusion Transformer for High-Fidelity Text-Aware Image Restoration"☆28Dec 22, 2025Updated 6 months ago
- ☆14Mar 29, 2026Updated 3 months ago
- ☆19Mar 25, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆16Apr 7, 2024Updated 2 years ago
- Encryption based File system ( based on ext2 hierarchical file system with inode as meta-data)☆22Mar 12, 2019Updated 7 years ago
- Instantly create video clips from LLM prompts☆175Aug 22, 2024Updated last year
- ☆21Jun 21, 2024Updated 2 years ago
- Build your Team's CLI☆17Jun 22, 2026Updated last week
- re-implementation of instantsplat (unofficial)☆16Aug 5, 2024Updated last year
- DSPy prompt optimization demo from AI Tinkerers presentation☆19Aug 15, 2025Updated 10 months ago
- Official Implementation of "Multi-Granularity Video Object Segmentation" (AAAI 2025)☆25Dec 20, 2024Updated last year
- Bring your code and propmpts easily to your LLM☆21Jun 10, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implemeting Meta AI's VGGT as a FiftyOne Remote Zoo Model☆20Jun 20, 2025Updated last year
- Flutter App capable of classifying 400 different species of birds using MobileNetV2. Upload an image from Google Drive or your phone gall…☆13Dec 10, 2022Updated 3 years ago
- Building self-refined guardrails via DSPy☆14Jun 25, 2026Updated last week
- Official implementation of "InterRVOS: Interaction-aware Referring Video Object Segmentation".☆31May 1, 2026Updated 2 months ago
- ☆13Oct 12, 2023Updated 2 years ago
- Make machine learning simpler with Galaxy☆12Jul 16, 2024Updated last year
- Build tools for LLMs in Rust using Model Context Protocol☆37Feb 25, 2025Updated last year
- This code is published for skyline detection☆31Mar 19, 2026Updated 3 months ago
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CLV prediction with pareto-NBD model☆12Jul 1, 2016Updated 10 years ago
- Official Implementation of "Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation"☆50Jan 29, 2026Updated 5 months ago
- Automation Assistant for UI Task Execution.☆11Jan 3, 2025Updated last year
- An AI agents framework addressing the two core challenges with real world agents - Optimisation and Deployement☆14Apr 3, 2024Updated 2 years ago
- A FUSE implementation in Rust for Git objects☆14Aug 25, 2016Updated 9 years ago
- a basic wrapper for Rate My Professors's GraphQL API☆16Oct 31, 2023Updated 2 years ago
- AI Music Structure Analyzer + Stem Splitter using Demucs & Mdx-Net with Python-Audio-Separator | Cog | Replicate☆14Mar 3, 2024Updated 2 years ago
- Code to scrape CVPR website for list of accepted papers, find their arXiv links, extract metadata, and download pdfs☆10Jun 12, 2024Updated 2 years ago
- [ISPRS] Plug-and-Play DISep: Separating Dense Instances for Scene-to-Pixel Weakly-Supervised Change Detection in High-Resolution Remote S…☆22Dec 15, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆16Jun 1, 2026Updated last month
- a RAG retrieval application that adapts to its specific user and topic , so that it's purpose built everytime.☆16Mar 18, 2024Updated 2 years ago
- Official implementation of Adapt3R: Adaptive 3D Scene Representation for Domain Transfer in Imitation Learning☆52Jun 10, 2026Updated 3 weeks ago
- VideoDB Python SDK☆96Jun 26, 2026Updated last week
- Steganography is data hidden within data. Steganography is an encryption technique that can be used along with cryptography as an extra-s…☆22Mar 16, 2017Updated 9 years ago
- Albumentations Data Augmentation Plugin for FiftyOne!☆15Aug 22, 2024Updated last year
- Crawl a Github repo files to create customized Github GPT assistant☆16Dec 20, 2023Updated 2 years ago