video-db / ocr-benchmarkLinks
Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments
☆44Updated 7 months ago
Alternatives and similar repositories for ocr-benchmark
Users that are interested in ocr-benchmark are comparing it to the libraries listed below
Sorting:
- Useful resources for LLM-based Diarization and Transcription.☆54Updated 11 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 9 months ago
- An automated tool for discovering insights from research papaer corpora☆139Updated last year
- ☆116Updated 9 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated 11 months ago
- Build AI Agents with Your Existing Python Code!☆67Updated 11 months ago
- ☆104Updated 3 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆46Updated 3 months ago
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆146Updated 2 weeks ago
- ☆46Updated last year
- ☆55Updated last month
- A couple scripts to grab stats from email☆43Updated last year
- auto fine tune of models with synthetic data☆75Updated last year
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆272Updated last year
- 🐮📢 The first AI voice assistant that interrupts *you*☆149Updated last year
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆79Updated last year
- The next evolution of Agents☆47Updated this week
- ☆88Updated this week
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆134Updated 4 months ago
- Gradio UI for a Cog API☆69Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆95Updated last year
- ☆30Updated 10 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated 8 months ago
- Embed anything.☆27Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated last year
- ☆22Updated 4 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Updated 11 months ago
- Build Web Datasets with Ease☆33Updated last year
- converts url content into JSON with a simple prefix☆71Updated last year