video-db / ocr-benchmarkLinks
Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments
☆42Updated 5 months ago
Alternatives and similar repositories for ocr-benchmark
Users that are interested in ocr-benchmark are comparing it to the libraries listed below
Sorting:
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 9 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated 9 months ago
- Gradio UI for a Cog API☆69Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 3 months ago
- ☆53Updated last month
- GRDN.AI app for garden optimization☆70Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 8 months ago
- ☆115Updated 6 months ago
- ☆101Updated last month
- Retrieve the source code for any model made available on replicate.com!☆34Updated last year
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- ☆28Updated 7 months ago
- The next evolution of Agents☆48Updated 2 weeks ago
- Cerule - A Tiny Mighty Vision Model☆66Updated 10 months ago
- ☆47Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆100Updated 6 months ago
- Build AI Agents with Your Existing Python Code!☆61Updated 8 months ago
- ☆89Updated 9 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆93Updated last year
- Build Web Datasets with Ease☆33Updated last year
- A dictionary, but it shows you position in embedding space relative to some synonyms/antonyms instead of a definition.☆74Updated 5 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated 3 weeks ago
- ☆50Updated last year
- converts url content into JSON with a simple prefix☆70Updated last year
- ☆84Updated this week
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆131Updated last month
- Demo of AI chatbot that predicts user message to generate response quickly.☆104Updated last year
- ☆66Updated last year
- This repository is an implementation of converting sketches into lively videos using Google's Veo 3 model.☆43Updated 2 weeks ago
- A couple scripts to grab stats from email☆43Updated 10 months ago