roboflow / inference
Turn any computer or edge device into a command center for your computer vision projects.
☆1,657Updated this week
Alternatives and similar repositories for inference:
Users that are interested in inference are comparing it to the libraries listed below
- Images to inference with no labeling (use foundation models to train supervised models).☆2,249Updated last month
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,557Updated this week
- The official Roboflow Python package. Manage your datasets, models, and deployments. Roboflow has everything you need to build a computer…☆385Updated last week
- [CVPR 2024] Real-Time Open-Vocabulary Object Detection☆5,366Updated 2 months ago
- RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.☆2,015Updated 2 weeks ago
- This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural R…☆1,901Updated last year
- [ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy☆2,468Updated 2 weeks ago
- An open-source computer vision framework to build and deploy apps in minutes☆751Updated last year
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,268Updated last week
- YOLOE: Real-Time Seeing Anything☆1,183Updated this week
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆943Updated 3 months ago
- Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥☆1,680Updated 3 months ago
- 4M: Massively Multimodal Masked Modeling☆1,719Updated 2 months ago
- CoTracker is a model for tracking any point (pixel) on a video.☆4,291Updated 3 months ago
- Code for replicating Roboflow 100 benchmark results and programmatically downloading benchmark datasets☆268Updated 6 months ago
- Data Labeling, Tracking and Annotation with AI☆344Updated 11 months ago
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]☆717Updated 10 months ago
- Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.☆4,749Updated 7 months ago
- 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection☆3,239Updated 5 months ago
- Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than…☆1,127Updated 4 months ago
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2☆2,072Updated 2 weeks ago
- ☆706Updated last year
- This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinf…☆918Updated 5 months ago
- tiny vision language model☆7,911Updated 3 weeks ago
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,237Updated this week
- MetaSeg: Packaged version of the Segment Anything repository☆979Updated this week
- ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expert…☆1,430Updated last month
- Segment Anything Labelling Tool☆1,038Updated last year
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,598Updated 9 months ago
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆2,329Updated 4 months ago