BenCaunt / MoondreamObjectTrackingLinks
Using the moondream VLM with optical flow for promptable object tracking
☆71Updated 6 months ago
Alternatives and similar repositories for MoondreamObjectTracking
Users that are interested in MoondreamObjectTracking are comparing it to the libraries listed below
Sorting:
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- ☆78Updated 9 months ago
- Mapping ping with a simple script and Ordinary Kriging to interpolate sparse measurements into a nice visualization!☆79Updated 10 months ago
- An automated tool for discovering insights from research papaer corpora☆139Updated last year
- ☆96Updated 9 months ago
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆124Updated 3 months ago
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆161Updated last month
- ☆80Updated last year
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆75Updated last year
- Code examples showing how to use Gemini, Gemma, Imagen, and more.☆43Updated 5 months ago
- ☆104Updated 3 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated last year
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.re…☆46Updated 10 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆266Updated 2 months ago
- The next evolution of Agents☆47Updated this week
- ☆116Updated 9 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- Take your LLM to the optometrist.☆40Updated last month
- Gradio UI for a Cog API☆69Updated last year
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆36Updated 9 months ago
- ☆102Updated last year
- Convert PowerPoint files into semantically rich text using vision language models☆102Updated 6 months ago
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆109Updated 6 months ago
- Pipecat voice AI agents running locally on macOS☆275Updated 3 weeks ago
- Jockey is a conversational video agent.☆89Updated 3 months ago
- ☆28Updated last year
- A threejs / WebGL / MediaPipe-powered interactive demo that allows you to control a 3D sphere using hand gestures.☆107Updated 3 months ago
- ☆107Updated 7 months ago
- Personal project, Generative AI, Streamlit, Python☆54Updated 4 months ago
- How to use bounding boxes with the Gemini API☆105Updated last year