BenCaunt / MoondreamObjectTracking
Using the moondream VLM with optical flow for promptable object tracking
☆51Updated last month
Alternatives and similar repositories for MoondreamObjectTracking:
Users that are interested in MoondreamObjectTracking are comparing it to the libraries listed below
- ☆29Updated 3 months ago
- Gradio UI for a Cog API☆66Updated 11 months ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 5 months ago
- A Model Context Protocol (MCP) server for interacting with fal.ai models and services.☆32Updated last week
- A discord bot to stay up to date with Hugging Face Daily Papers.☆14Updated 11 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆28Updated 2 months ago
- ☆91Updated 3 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 6 months ago
- The next evolution of Agents☆48Updated last week
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆45Updated 3 weeks ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated last year
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- Open-source AI for voice control, rivaling Alexa and Siri☆12Updated last year
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 6 months ago
- ☆111Updated 3 months ago
- A couple scripts to grab stats from email☆42Updated 6 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 5 months ago
- ☆22Updated 5 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 9 months ago
- ☆75Updated 3 months ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆73Updated last year
- ☆38Updated 6 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 9 months ago
- ☆41Updated 10 months ago
- Multi-person podcast audio to videocast☆10Updated 6 months ago
- Mapping ping with a simple script and Ordinary Kriging to interpolate sparse measurements into a nice visualization!☆80Updated 5 months ago
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆32Updated 3 months ago
- ☆77Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated 5 months ago
- ☆85Updated 2 months ago