roboflow / cookbooksLinks
Templates for computer vision projects, referenced in Roboflow blog posts.
☆19Updated last year
Alternatives and similar repositories for cookbooks
Users that are interested in cookbooks are comparing it to the libraries listed below
Sorting:
- ☆15Updated last year
- Example showing how to do inference on a video file with Roboflow Infer☆48Updated last year
- Simple CogVLM client script☆14Updated last year
- Flask-based web application designed to compare text and image embeddings using the CLIP model.☆22Updated last year
- examples and guides to using Nomic Atlas☆38Updated 5 months ago
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆135Updated 3 weeks ago
- A repository for creating, and sample code for consuming an ONNX embedding model☆33Updated 2 years ago
- Convert an audio file to a waveform video☆11Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆13Updated last year
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆89Updated last week
- A template repo holding our common setup for a python project☆120Updated 2 years ago
- repo for versioning snippets that show how to use Roboflow APIs☆20Updated last year
- Using GPT-3 and Carrot (GPT-3 for computer vision) to create detailed descriptions of images.☆14Updated 3 years ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆66Updated last year
- ☆14Updated last year
- This Repo focuses on defending against 'adversarial prompts,' detecting and attempting to mitigate objectionable content in real time.☆12Updated 2 years ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆37Updated 2 weeks ago
- Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from imag…☆117Updated 2 years ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated last year
- Chat to Compose Video☆195Updated last year
- ☆40Updated 9 months ago
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆29Updated 2 years ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated 11 months ago
- Speak (speech-to-text) to LLMs (Ollama) in any lanaguage - Streamlit app☆47Updated last year
- The purpose of this repository is to discuss on Audio transformers☆13Updated last month
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆17Updated last week
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- This repo lets you run mistral-7b in Google Colab.☆16Updated last year