BenCaunt / MoondreamObjectTrackingLinks
Using the moondream VLM with optical flow for promptable object tracking
☆73Updated 10 months ago
Alternatives and similar repositories for MoondreamObjectTracking
Users that are interested in MoondreamObjectTracking are comparing it to the libraries listed below
Sorting:
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- ☆94Updated last year
- ☆82Updated last year
- Mapping ping with a simple script and Ordinary Kriging to interpolate sparse measurements into a nice visualization!☆79Updated last year
- ☆78Updated last year
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆163Updated 5 months ago
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.re…☆46Updated last year
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆126Updated 7 months ago
- ☆107Updated 2 months ago
- ☆30Updated last year
- This repository stores the source code for the Mistral Hackathon 2024 in Paris☆16Updated last year
- Inference, Fine Tuning and many more recipes with Gemma family of models☆277Updated 6 months ago
- The next evolution of Agents☆48Updated this week
- Embed anything.☆27Updated last year
- A threejs / WebGL / MediaPipe-powered interactive demo that allows you to control a 3D sphere using hand gestures.☆133Updated 7 months ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated last year
- ☆119Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆66Updated last year
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆46Updated 6 months ago
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆42Updated 3 months ago
- Gradio UI for a Cog API☆71Updated last year
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated 2 years ago
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆134Updated last year
- An open-source Discord bot, created using LlamaIndex, that - Listens to your server conversations, continuously learns from them & answe…☆76Updated last year
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆72Updated 4 months ago
- Personal project, Generative AI, Streamlit, Python☆54Updated 8 months ago
- ☆75Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆123Updated last year
- ☆101Updated last year