BenCaunt / MoondreamObjectTrackingLinks
Using the moondream VLM with optical flow for promptable object tracking
☆73Updated 11 months ago
Alternatives and similar repositories for MoondreamObjectTracking
Users that are interested in MoondreamObjectTracking are comparing it to the libraries listed below
Sorting:
- ☆94Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆165Updated 6 months ago
- Mapping ping with a simple script and Ordinary Kriging to interpolate sparse measurements into a nice visualization!☆79Updated last year
- ☆82Updated last year
- ☆80Updated last year
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.re…☆46Updated last year
- The next evolution of Agents☆48Updated last week
- An automated tool for discovering insights from research papaer corpora☆137Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆66Updated last year
- A threejs / WebGL / MediaPipe-powered interactive demo that allows you to control a 3D sphere using hand gestures.☆137Updated 8 months ago
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆129Updated 8 months ago
- All the world is a play, we are but actors in it.☆49Updated 6 months ago
- Gradio UI for a Cog API☆70Updated last year
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆42Updated 3 months ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆77Updated 2 years ago
- Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments☆46Updated 11 months ago
- ☆30Updated last year
- How to use bounding boxes with the Gemini API☆106Updated last year
- ☆119Updated last year
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated 2 years ago
- ☆107Updated 3 months ago
- GRDN.AI app for garden optimization☆69Updated 2 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆46Updated 7 months ago
- ☆108Updated last year
- Cerule - A Tiny Mighty Vision Model☆68Updated 2 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated last year
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆21Updated last year
- Realtime Voice and Vision wtih Brilliant Labs Frame and Gemini☆68Updated 8 months ago
- Starter app for creating an AI task completion agent with gmail capabilities.☆27Updated last year