video-db / ocr-benchmark
Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments
☆34Updated last month
Alternatives and similar repositories for ocr-benchmark:
Users that are interested in ocr-benchmark are comparing it to the libraries listed below
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆87Updated 3 months ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 5 months ago
- ☆29Updated 4 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 5 months ago
- Using modal.com to process FineWeb-edu data☆20Updated 3 weeks ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 6 months ago
- Using the moondream VLM with optical flow for promptable object tracking☆51Updated last month
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆45Updated last month
- Build AI Agents with Your Existing Python Code!☆56Updated 5 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 8 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 9 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆71Updated 6 months ago
- ☆111Updated 3 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 2 months ago
- Gradio UI for a Cog API☆66Updated 11 months ago
- ☆47Updated 11 months ago
- Build Web Datasets with Ease☆33Updated 9 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 6 months ago
- Run Python functions on desktop, mobile, web, and in the cloud. https://fxn.ai/explore☆47Updated this week
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated 6 months ago
- Very minimal (and stateless) agent framework☆41Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- converts url content into JSON with a simple prefix☆67Updated 10 months ago
- All the world is a play, we are but actors in it.☆47Updated this week
- auto fine tune of models with synthetic data☆75Updated last year
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆30Updated 4 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- The next evolution of Agents☆48Updated 2 weeks ago
- ☆51Updated 4 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆31Updated last month