BenCaunt / MoondreamObjectTrackingLinks
Using the moondream VLM with optical flow for promptable object tracking
☆68Updated 5 months ago
Alternatives and similar repositories for MoondreamObjectTracking
Users that are interested in MoondreamObjectTracking are comparing it to the libraries listed below
Sorting:
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆82Updated last year
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆158Updated 3 months ago
- ☆80Updated last year
- ☆77Updated 7 months ago
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆119Updated 2 months ago
- Mapping ping with a simple script and Ordinary Kriging to interpolate sparse measurements into a nice visualization!☆79Updated 9 months ago
- ☆115Updated 7 months ago
- ☆102Updated last month
- ☆95Updated 7 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated last month
- Pipecat voice AI agents running locally on macOS☆88Updated last week
- The next evolution of Agents☆48Updated 2 weeks ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 9 months ago
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- ☆108Updated 6 months ago
- This repository stores the source code for the Mistral Hackathon 2024 in Paris☆16Updated 11 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆262Updated 2 weeks ago
- An open-source Discord bot, created using LlamaIndex, that - Listens to your server conversations, continuously learns from them & answe…☆77Updated last year
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.re…☆46Updated 8 months ago
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆73Updated last month
- ☆22Updated 2 months ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆75Updated last year
- Personal project, Generative AI, Streamlit, Python☆54Updated 3 months ago
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆271Updated 11 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- A threejs / WebGL / MediaPipe-powered interactive demo that allows you to control a 3D sphere using hand gestures.☆104Updated 2 months ago
- Convert PowerPoint files into semantically rich text using vision language models☆100Updated 5 months ago
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆109Updated 5 months ago
- ☆107Updated 4 months ago
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆36Updated 8 months ago