An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Ollama). It ensures privacy and offline use with a user-friendly GUI.
☆68Mar 15, 2026Updated last month
Alternatives and similar repositories for ai-powered-video-analyzer
Users that are interested in ai-powered-video-analyzer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Find and Use Cheats via the PythonGDB API☆11Aug 1, 2016Updated 9 years ago
- C++ library for audio and music analysis, description and synthesis, including Python bindings☆11Feb 12, 2026Updated 2 months ago
- A ComfyUI extension for generating captions of images.☆29May 12, 2025Updated 11 months ago
- A "loopback on steroids" type of extension for Stable Diffusion Web UI.☆31Oct 10, 2025Updated 6 months ago
- Local GLM-4 Prompt Enhancer and Inference for ComfyUI☆30Jul 20, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This extension provides inference-time optimization techniques to enhance diffusion-based image generation quality through random search …☆23Feb 27, 2025Updated last year
- Chat with your RVC models. See website for demo:☆22Feb 15, 2024Updated 2 years ago
- Argument Mining Python implementation☆13Jul 28, 2023Updated 2 years ago
- Kotlin extension for VSCode☆34Jun 9, 2018Updated 7 years ago
- firmware for Gecho Loopsynth, STM32F4 based pocket synth (http://gechologic.com/)☆10Apr 22, 2019Updated 6 years ago
- ☆11Jul 6, 2022Updated 3 years ago
- Soundscape Ecology Toolkit☆11Mar 25, 2016Updated 10 years ago
- files for pihole fail2ban☆10Jun 21, 2019Updated 6 years ago
- Real-time streaming data pipeline for Twitter Tweets☆10Jan 31, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Repository for SF2SE3: Clustering Scene Flow into SE(3)-Motions via Proposal and Selection☆12Jul 26, 2024Updated last year
- Predicting chicago crime data using network science☆10Aug 26, 2018Updated 7 years ago
- A Wavenet generative model in TensorFlow, trained with Western Classical solo piano canon with global and local conditioning☆11Oct 26, 2017Updated 8 years ago
- Is the placement of bike-share docks equitable?☆11Mar 15, 2024Updated 2 years ago
- Driving Futures is an application that simulates and visualizes parking utilization by shared and autonomous passenger vehicles in hypoth…☆11Nov 19, 2018Updated 7 years ago
- A visualization tool for using persistent homology to interact with undirected graphs.☆10Jul 10, 2019Updated 6 years ago
- Load and run SDNQ quantized models in ComfyUI with 50-75% VRAM savings!☆76Updated this week
- Master MVA, ENS Cachan, France: 3D Point Cloud Processing. Implementation of the research article "Segmentation Based Classification of 3…☆15Oct 3, 2019Updated 6 years ago
- Just aNO(t)DEr nodered for heroku☆11Oct 13, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- JavaScript simulation of the classical XY model☆14Sep 12, 2013Updated 12 years ago
- ☆15May 17, 2022Updated 3 years ago
- RoboPaint RT: Processing-based software for the WaterColorBot, to make and print drawings in real time.☆22Oct 3, 2017Updated 8 years ago
- MIDI for Arduino☆20Dec 21, 2021Updated 4 years ago
- An LSTM implementation for a Rap Lyric Generator that spawns rap lyrics based on a Kaggle dataset with over 38,000 lines.☆12Dec 20, 2018Updated 7 years ago
- GliTr Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction☆25Apr 14, 2023Updated 3 years ago
- A collection of pix2pix models☆23Sep 1, 2018Updated 7 years ago
- RGinger takes an English sentence and gives correction and rephrasing suggestions for it using Ginger proofreading API.☆17May 20, 2020Updated 5 years ago
- Convert an audio file to a compressed (gzip) SVG waveform☆25Dec 1, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A multiplayer shape drawing app with colours inspired by Byrne's edition of Euclid☆24Dec 11, 2022Updated 3 years ago
- Ultra fast frame interpolation using Rife Tensorrt inside ComfyUI☆100Sep 23, 2025Updated 6 months ago
- [WACV 2023] Code for "M-FUSE: Multi-frame Fusion for Scene Flow Estimation"☆22Oct 28, 2022Updated 3 years ago
- [ALPHA] Models for Runway☆33Nov 5, 2018Updated 7 years ago
- ☆36Nov 14, 2024Updated last year
- Collection of DCASE related datasets