roboflow / inference
Turn any computer or edge device into a command center for your computer vision projects.
☆1,443Updated this week
Alternatives and similar repositories for inference:
Users that are interested in inference are comparing it to the libraries listed below
- Images to inference with no labeling (use foundation models to train supervised models).☆2,058Updated last month
- streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL☆1,427Updated this week
- The official Roboflow Python package. Manage your datasets, models, and deployments. Roboflow has everything you need to build a computer…☆338Updated this week
- [CVPR 2024] Real-Time Open-Vocabulary Object Detection☆4,942Updated 2 months ago
- Whereabouts Ascertainment for Low-lying Detectable Objects. The SOTA in FOSS AI for drones!☆1,368Updated last week
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆2,752Updated last week
- CoTracker is a model for tracking any point (pixel) on a video.☆4,069Updated 3 weeks ago
- This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural R…☆1,850Updated last year
- Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything☆1,152Updated 2 months ago
- ☆1,265Updated last year
- Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥☆1,667Updated this week
- [ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy☆2,360Updated 2 months ago
- This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinf…☆792Updated last month
- An open-source computer vision framework to build and deploy apps in minutes☆732Updated 8 months ago
- 👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]☆596Updated 10 months ago
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]☆685Updated 6 months ago
- Segment Anything Labelling Tool☆1,028Updated 10 months ago
- An MIT rewrite of YOLOv9☆912Updated 2 weeks ago
- A high-level programming language for using computer vision.☆343Updated 9 months ago
- Code for replicating Roboflow 100 benchmark results and programmatically downloading benchmark datasets☆258Updated 2 months ago
- High-resolution models for human tasks.☆4,763Updated last month
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆2,237Updated 3 weeks ago
- 4M: Massively Multimodal Masked Modeling☆1,666Updated 3 months ago
- YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]☆10,280Updated 3 months ago
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,446Updated 5 months ago
- Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.☆2,705Updated this week
- Segment Anything in High Quality [NeurIPS 2023]☆3,774Updated last month
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,256Updated 9 months ago
- A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT☆696Updated last year
- Tracking Anything in High Quality☆746Updated last year