BenCaunt / MoondreamObjectTrackingLinks
Using the moondream VLM with optical flow for promptable object tracking
☆72Updated 10 months ago
Alternatives and similar repositories for MoondreamObjectTracking
Users that are interested in MoondreamObjectTracking are comparing it to the libraries listed below
Sorting:
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- ☆94Updated last year
- Mapping ping with a simple script and Ordinary Kriging to interpolate sparse measurements into a nice visualization!☆79Updated last year
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆162Updated 4 months ago
- ☆83Updated last year
- ☆107Updated 2 months ago
- The next evolution of Agents☆48Updated last week
- ☆78Updated last year
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.re…☆46Updated last year
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- ☆30Updated last year
- An open-source Discord bot, created using LlamaIndex, that - Listens to your server conversations, continuously learns from them & answe…☆76Updated last year
- ☆118Updated last year
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆125Updated 6 months ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated 2 years ago
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆42Updated 2 months ago
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆108Updated 10 months ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated last year
- A threejs / WebGL / MediaPipe-powered interactive demo that allows you to control a 3D sphere using hand gestures.☆126Updated 7 months ago
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆135Updated last year
- All the world is a play, we are but actors in it.☆49Updated 5 months ago
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆21Updated last year
- Gradio UI for a Cog API☆71Updated last year
- Command-line script for inferencing from models such as WizardCoder☆25Updated 2 years ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Updated 2 years ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆46Updated 6 months ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆76Updated last year
- Use of vision models to play a "pictionary" style draw-guessing.☆43Updated last year
- ☆22Updated 7 months ago
- This repository stores the source code for the Mistral Hackathon 2024 in Paris☆16Updated last year