roboflow / roboflow-collect
Passively collect images for computer vision datasets on the edge.
☆31Updated last year
Alternatives and similar repositories for roboflow-collect:
Users that are interested in roboflow-collect are comparing it to the libraries listed below
- ☆14Updated last year
- MetaCLIP module for use with Autodistill.☆21Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆35Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆62Updated 7 months ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆66Updated last year
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆68Updated this week
- ☆46Updated last year
- Notebooks using the Neural Magic libraries 📓☆41Updated 7 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 9 months ago
- Flask-based web application designed to compare text and image embeddings using the CLIP model.☆22Updated last year
- ☆15Updated 10 months ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆11Updated 7 months ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆33Updated 2 months ago
- GPT-4V(ision) module for use with Autodistill.☆26Updated 7 months ago
- Tools for merging pretrained large language models.☆19Updated 9 months ago
- Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from imag…☆112Updated last year
- Label your images using GPT-4!☆17Updated last year
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆126Updated 2 weeks ago
- An ONNX-based implementation of the CLIP model that doesn't depend on torch or torchvision.☆66Updated 8 months ago
- Web Interface for Vision Language Models Including InternVLM2☆19Updated 7 months ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆17Updated 5 months ago
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆22Updated 4 months ago
- Integrate an LLM copilot within your Keras model development workflow☆28Updated last year
- ☆29Updated last year