video-db / ocr-benchmark
Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments
☆37Updated 2 months ago
Alternatives and similar repositories for ocr-benchmark:
Users that are interested in ocr-benchmark are comparing it to the libraries listed below
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 6 months ago
- Gradio UI for a Cog API☆67Updated last year
- ☆51Updated 5 months ago
- Using modal.com to process FineWeb-edu data☆20Updated 2 weeks ago
- Using the moondream VLM with optical flow for promptable object tracking☆53Updated 2 months ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆126Updated 7 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated 6 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 6 months ago
- A couple scripts to grab stats from email☆42Updated 7 months ago
- Build AI Agents with Your Existing Python Code!☆56Updated 5 months ago
- auto fine tune of models with synthetic data☆75Updated last year
- Build Web Datasets with Ease☆33Updated 10 months ago
- ☆79Updated last week
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 7 months ago
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆22Updated 11 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆91Updated 4 months ago
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- LoRA Explorer model to test with LoRAs using Flux.1[Dev] as the base model☆46Updated 6 months ago
- ☆29Updated 4 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- converts url content into JSON with a simple prefix☆68Updated 11 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆71Updated 7 months ago
- A high performance batching router optimises max throughput for text inference workload☆16Updated last year
- Run Python functions on desktop, mobile, web, and in the cloud. https://fxn.ai/explore☆57Updated this week
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 5 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 9 months ago
- ☆17Updated 4 months ago
- ☆22Updated 9 months ago
- Refactor your code with local LLM in VSCode☆13Updated last year