yhenon / llm-face-visionLinks
Benchmarking vision language vision on face tasks
☆16Updated 8 months ago
Alternatives and similar repositories for llm-face-vision
Users that are interested in llm-face-vision are comparing it to the libraries listed below
Sorting:
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 11 months ago
- Evaluation framework for document processing models and services.☆59Updated this week
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆23Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- ☆51Updated last year
- This repository contains a Multimodal Retrieval-Augmented Generation (RAG) Pipeline that integrates images, audio, and text for advanced …☆24Updated 11 months ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 4 months ago
- ☆62Updated 5 months ago
- ☆59Updated last year
- Supporting code for: Video Enriched Retrieval Augmented Generation Using Aligned Video Captions☆32Updated last year
- Visual RAG using less than 300 lines of code.☆29Updated last year
- ☆27Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated 2 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆97Updated last year
- ☆101Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Updated last year
- code for training and using chess embeddings models☆13Updated last year
- tickr-agent is an enterprise-ready, scalable Python library for building swarms of financial agents that conduct comprehensive stock anal…☆51Updated 2 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆23Updated last year
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- ☆70Updated last year
- ☆55Updated last year
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- unsloth-5090-multiple☆60Updated 7 months ago
- Effective frame sampling for ML applications.☆24Updated 3 months ago
- A collection of reproducible inference engine benchmarks☆38Updated 7 months ago
- ☆21Updated last year
- Modified Beam Search with periodical restart☆12Updated last year
- Tools for merging pretrained large language models.☆19Updated last year