dnth / x.infer
Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timm, ultralytics, vllm, ollama and your custom model.
☆119Updated this week
Related projects ⓘ
Alternatives and complementary repositories for x.infer
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆77Updated 5 months ago
- ☆93Updated 2 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆136Updated 3 weeks ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆165Updated 3 weeks ago
- ☆81Updated last month
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826☆51Updated last month
- An automated tool for discovering insights from research papaer corpora☆135Updated 5 months ago
- Routing on Random Forest (RoRF)☆83Updated last month
- Video+code lecture on building nanoGPT from scratch☆64Updated 4 months ago
- Gradio UI for a Cog API☆64Updated 7 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆55Updated 3 months ago
- ☆104Updated 7 months ago
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.re…☆45Updated last month
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆116Updated 9 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆42Updated 2 months ago
- ☆67Updated last month
- Cog wrapper for ostris/ai-toolkit + post-finetuning cog inference for flux models☆293Updated 3 weeks ago
- From scratch implementation of a vision language model in pure PyTorch☆160Updated 6 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆126Updated this week
- Build your own Face App with Stable Diffusion 2.1☆140Updated last month
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆77Updated last year
- ☆64Updated 5 months ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆153Updated 10 months ago
- Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backed☆129Updated 6 months ago
- A toolkit for building multimodal AI agents☆107Updated 2 weeks ago
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"☆88Updated last week
- A Gradio component that can be used to annotate images with bounding boxes.☆31Updated 2 weeks ago
- Instantly calculate the maximum size of quantized language models that can fit in your available RAM, helping you optimize your models fo…☆94Updated 2 weeks ago
- run paligemma in real time☆122Updated 5 months ago
- Simple program to manually caption your images (or any other file types) so you can use them for AI training☆37Updated last year