roboflow / inference
A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
☆1,270Updated this week
Related projects: ⓘ
- Images to inference with no labeling (use foundation models to train supervised models).☆1,851Updated last week
- streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, Phi-3.5 Vision☆1,285Updated this week
- Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥☆1,627Updated 6 months ago
- The official Roboflow Python package. Manage your datasets, models, and deployments. Roboflow has everything you need to build a computer…☆280Updated this week
- An open-source computer vision framework to build and deploy apps in minutes☆705Updated 4 months ago
- [CVPR 2024] Real-Time Open-Vocabulary Object Detection☆4,360Updated last month
- This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural R…☆1,810Updated 9 months ago
- [ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy☆2,156Updated 3 weeks ago
- ☆1,251Updated 10 months ago
- 4M: Massively Multimodal Masked Modeling☆1,543Updated 2 months ago
- CoTracker is a model for tracking any point (pixel) on a video.☆2,679Updated last month
- Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than…☆1,019Updated 3 months ago
- A modern model graph visualizer and debugger☆984Updated this week
- PyTorch code and models for V-JEPA self-supervised learning from video.☆2,614Updated last month
- tiny vision language model☆4,893Updated 3 weeks ago
- On-device AI across mobile, embedded and edge for PyTorch☆1,698Updated this week
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,235Updated 5 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆616Updated last week
- 👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]☆564Updated 6 months ago
- Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything☆952Updated this week
- Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.☆2,096Updated this week
- ☆694Updated 6 months ago
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆2,081Updated 3 months ago
- Tracking Anything in High Quality☆743Updated 9 months ago
- This repository is a curated collection of the most exciting and influential CVPR 2023 papers. 🔥 [Paper + Code]☆634Updated 2 months ago
- A high-level programming language for using computer vision.☆342Updated 5 months ago
- Whereabouts Ascertainment for Low-lying Detectable Objects. The SOTA in FOSS AI for drones!☆530Updated 5 months ago
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]☆630Updated 2 months ago
- Inference Llama 2 in one file of pure 🔥☆2,091Updated 3 months ago
- VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and…☆1,786Updated last week