Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments
☆47Feb 14, 2025Updated last year
Alternatives and similar repositories for ocr-benchmark
Users that are interested in ocr-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Copy clip sucks, so this is a better one made with Expo + Electron.☆11Oct 5, 2023Updated 2 years ago
- Official PyTorch Implementation of "Better Source, Better Flow: Learning Condition-Dependent Source Distribution for Flow Matching"☆31Mar 1, 2026Updated 2 months ago
- Official implementation of "Unified Diffusion Transformer for High-Fidelity Text-Aware Image Restoration"☆28Dec 22, 2025Updated 5 months ago
- Ayle Chat is a custom-built AI chat application leveraging the power of Groq and Exa Search for unparalleled speed and providing immediat…☆11Dec 15, 2025Updated 5 months ago
- Chrome extension to convert a GitHub repo to text prompt☆14Dec 14, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Sep 13, 2020Updated 5 years ago
- ☆16Jul 9, 2024Updated last year
- A browser script that turns web-based LLMs (ChatGPT, Gemini, Claude, Perplexity) into APIs, allowing you to automate interactions with th…☆13Dec 12, 2025Updated 5 months ago
- This is a python bot script that reads a pdf file, then it opens a navigator window (Google or Firefox) at chatgpt openai website. and he…☆17Dec 16, 2024Updated last year
- A collection of sophisticated computer vision and machine learning problems for graduate-level researchers and practitioners☆40Jun 13, 2025Updated 11 months ago
- Play games right in your X feed☆15May 2, 2025Updated last year
- [ICASSP2024] An official implement of the paper "EFFICIENT SCENE TEXT IMAGE SUPER-RESOLUTION WITH SEMANTIC GUIDANCE"☆24May 12, 2024Updated 2 years ago
- DSPy prompt optimization demo from AI Tinkerers presentation☆19Aug 15, 2025Updated 9 months ago
- ☆35Feb 14, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for our paper "Fixed-point Inversion for Text-to-image diffusion models"☆19Oct 13, 2024Updated last year
- 🚩 FastImage, performant React Native image component.☆14Aug 14, 2018Updated 7 years ago
- Bring your code and propmpts easily to your LLM☆21Jun 10, 2025Updated 11 months ago
- CodeMerge is useful for consolidating code from various files into a single file that can be used as context for AI code generation model…☆24Feb 28, 2026Updated 2 months ago
- Building self-refined guardrails via DSPy☆14Jul 2, 2024Updated last year
- ☆13Oct 12, 2023Updated 2 years ago
- Make machine learning simpler with Galaxy☆12Jul 16, 2024Updated last year
- Build tools for LLMs in Rust using Model Context Protocol☆37Feb 25, 2025Updated last year
- A project to take an audio file and separate it into speakers and play it with avatars and save the recording as an mp4 for sharing on so…☆13Nov 6, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- Official Implementation of "Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation"☆49Jan 29, 2026Updated 3 months ago
- Automation Assistant for UI Task Execution.☆11Jan 3, 2025Updated last year
- ☆19Mar 28, 2022Updated 4 years ago
- AI Music Structure Analyzer + Stem Splitter using Demucs & Mdx-Net with Python-Audio-Separator | Cog | Replicate☆13Mar 3, 2024Updated 2 years ago
- Uses Convex to mirror discord into slack☆17Mar 17, 2026Updated 2 months ago
- PyRex is a Python GUI regular expression tool. Alternative, offline version of regex101☆17Mar 19, 2024Updated 2 years ago
- walterra's collections of helpers for agentic coding☆34Mar 23, 2026Updated 2 months ago
- ☆16May 12, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Collections of Actions for Custom GPTs (some created by Captain Action)☆11Jan 7, 2024Updated 2 years ago
- ☆24Mar 6, 2023Updated 3 years ago
- Fast and Computationally efficient Continual Learning for NanoDet anchor-free Object Detector☆13Dec 16, 2024Updated last year
- Official implementation of "AM-Adapter: Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis in-the-Wild" (ICCV 2025)☆26Jul 8, 2025Updated 10 months ago
- VideoDB Python SDK☆95May 15, 2026Updated last week
- A Chrome extension that automatically extracts webpage content when sharing URLs with Claude, enabling deeper conversations and better an…☆21Nov 12, 2024Updated last year
- A browser extension that demos Gemini Nano via window.ai and Cartesia TTS ⚡️☆38Jul 10, 2024Updated last year