BenCaunt / MoondreamObjectTrackingLinks
Using the moondream VLM with optical flow for promptable object tracking
☆57Updated 4 months ago
Alternatives and similar repositories for MoondreamObjectTracking
Users that are interested in MoondreamObjectTracking are comparing it to the libraries listed below
Sorting:
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- Gradio UI for a Cog API☆68Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆81Updated last year
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 8 months ago
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- ☆114Updated 6 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated this week
- ☆28Updated 6 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆80Updated last month
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated 8 months ago
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆34Updated 6 months ago
- All the world is a play, we are but actors in it.☆50Updated this week
- This repository is an implementation of converting sketches into lively videos using Google's Veo 3 model.☆38Updated last week
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.re…☆46Updated 7 months ago
- ☆28Updated last year
- ☆21Updated 7 months ago
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆43Updated 5 months ago
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆30Updated 8 months ago
- ☆22Updated 8 months ago
- Mapping ping with a simple script and Ordinary Kriging to interpolate sparse measurements into a nice visualization!☆79Updated 7 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆86Updated last year
- ☆95Updated 6 months ago
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆108Updated 4 months ago
- The next evolution of Agents☆48Updated this week
- Realtime Voice and Vision wtih Brilliant Labs Frame and Gemini☆58Updated last month
- SmolVLM2 Demo☆154Updated 3 months ago
- ☆21Updated 3 weeks ago
- Vanilla-Python ergonomics on top of DSPy☆31Updated 3 weeks ago
- Open-source AI for voice control, rivaling Alexa and Siri☆12Updated last year