dnth / x.infer

Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timm, ultralytics, vllm, ollama and your custom model.

☆119

Related projects ⓘ

Alternatives and complementary repositories for x.infer

adithya-s-k / YoloGemma
Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…
☆77Updated 5 months ago
AlexBodner / How_Much_VRAM
☆93Updated 2 months ago
DataformerAI / dataformer
Solving data for LLMs - Create quality synthetic datasets!
☆136Updated 3 weeks ago
tonywu71 / colpali-cookbooks
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻‍🍳
☆165Updated 3 weeks ago
AK391 / dailypapersHN
☆81Updated last month
SanshruthR / CCTV_YOLO
Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826
☆51Updated last month
ai8hyf / OpenResearchAssistant
An automated tool for discovering insights from research papaer corpora
☆135Updated 5 months ago
Not-Diamond / RoRF
Routing on Random Forest (RoRF)
☆83Updated last month
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆64Updated 4 months ago
multimodalart / grog
Gradio UI for a Cog API
☆64Updated 7 months ago
aniketmaurya / fastserve-ai
Machine Learning Serving focused on GenAI with simplicity as the top priority.
☆55Updated 3 months ago
teknium1 / ShareGPT-Builder
☆104Updated 7 months ago
ryderwishart / daily-research-bot
Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.re…
☆45Updated last month
aigeek0x0 / rag-with-langchain-colbert-and-ragatouille
Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB
☆116Updated 9 months ago
davidberenstein1957 / dataset-viber
Dataset Viber is your chill repo for data collection, annotation and vibe checks.
☆42Updated 2 months ago
yvrjsharma / HugginFace_Gradio
☆67Updated last month
replicate / flux-fine-tuner
Cog wrapper for ostris/ai-toolkit + post-finetuning cog inference for flux models
☆293Updated 3 weeks ago
AviSoori1x / seemore
From scratch implementation of a vision language model in pure PyTorch
☆160Updated 6 months ago
illuin-tech / vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
☆126Updated this week
OutofAi / StableFace
Build your own Face App with Stable Diffusion 2.1
☆140Updated last month
SkalskiP / SoM
Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️
☆77Updated last year
cognitivecomputations / kraken
☆64Updated 5 months ago
NousResearch / Obsidian
Maybe the new state of the art vision model? we'll see 🤷‍♂️
☆153Updated 10 months ago
metaswang / bao
Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backed
☆129Updated 6 months ago
agentsea / surfkit
A toolkit for building multimodal AI agents
☆107Updated 2 weeks ago
neulab / Pangea
This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"
☆88Updated last week
edgarGracia / gradio_image_annotator
A Gradio component that can be used to annotate images with bounding boxes.
☆31Updated 2 weeks ago
RayFernando1337 / LLM-Calc
Instantly calculate the maximum size of quantized language models that can fit in your available RAM, helping you optimize your models fo…
☆94Updated 2 weeks ago
sumo43 / loopvlm
run paligemma in real time
☆122Updated 5 months ago
ANTONIOPSD / CaptionIMG
Simple program to manually caption your images (or any other file types) so you can use them for AI training
☆37Updated last year