Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.
☆68Oct 2, 2025Updated 5 months ago
Alternatives and similar repositories for kara-audio
Users that are interested in kara-audio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implements ML audio separation algorithm on audio from YouTube or Spotify resulting in "stems" for download (e.g. vocals, drums, bass) in…☆37Dec 8, 2025Updated 3 months ago
- ULL (<1s) Live Video Streaming over HTTP CDNs☆13Sep 11, 2024Updated last year
- Automatically create synchronised lyrics files in ASS and LRC with word-level timestamps, using Whisper and lyrics from online sources, w…☆90Jan 19, 2026Updated 2 months ago
- Examples of how to use API of MVSep service☆30Jun 21, 2025Updated 9 months ago
- This repository contains content related to 2D and 3D lane detection, as well as video lane detection. There are not only papers here, bu…☆13Sep 1, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).☆19Apr 1, 2021Updated 4 years ago
- The second generation of VoiceFixer, a toolkit for general speech restoration. *Not affiliated with the original VoiceFixer repo*☆21Nov 19, 2023Updated 2 years ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆107Jan 8, 2026Updated 2 months ago
- Behringer BCD2000 custom firmware to use w/o driver☆13Dec 15, 2023Updated 2 years ago
- An offline CPU-first low-resource chat application to perform RAG on your corpus of data. Powered by OpenChat and CTranslate2.☆14May 14, 2025Updated 10 months ago
- AudioSR-Colab-Fork☆51Oct 12, 2025Updated 5 months ago
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆27Oct 13, 2025Updated 5 months ago
- extension developers and users can submit working commit versions of webui and the extension if needed.☆38May 4, 2023Updated 2 years ago
- 📁 ○ ○ ○ dotfolders and dotfiles☆17Updated this week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Mycelium API Gateway, the ultimate solution for secure, flexible, and multi-tenant API management☆22Mar 12, 2026Updated 2 weeks ago
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆619Oct 18, 2025Updated 5 months ago
- A visualizer for pixel mapping with realtime DMX output using TouchDesigner.☆12Apr 22, 2020Updated 5 years ago
- Code for a custom, 2-universe, wireless ArtNet to DMX Node☆21Oct 20, 2023Updated 2 years ago
- Singing-Voice Separation From Monaural Recordings Using Deep Recurrent Neural Networks☆64Jun 28, 2018Updated 7 years ago
- Raspberry Pi DMX512 / RDM / MIDI / OSC / Art-Net / WS28xx / TLC59711☆18Apr 3, 2020Updated 5 years ago
- Multi-channel, multi-track, multi-player player for audio files.☆11Feb 7, 2026Updated last month
- An application for detects and displays the pitch of musical notes played on a musical instrument.☆13May 8, 2022Updated 3 years ago
- The Open Source AI Musical Toolkit☆50Mar 19, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ComfyUI nodes for transcription on audio or video input.☆32Apr 23, 2025Updated 11 months ago
- A place to store community presets for the No Man's Sky Base Builder!☆12Jun 6, 2023Updated 2 years ago
- Google collab for testing SoftVC VITS Singing Voice Conversion for AI capable of changing the singer within music files.☆13Apr 21, 2023Updated 2 years ago
- PostgreSQL Logical Replication CDC Module for Streaming Database Changes with Golang☆27Sep 30, 2024Updated last year
- Tuya Zigbee MCU SDK Arduino Library enables interfacing your Arduino with Tuya's network module, helping you build an IoT-enabled project…☆15Sep 4, 2023Updated 2 years ago
- 2019_ML_Course Singing Voice Conversion Using Cycle-GAN:VC2☆17Dec 30, 2020Updated 5 years ago
- Additional non-node based UI for ComfyUI focused on inference. Stable UI states; presets; and advanced queue. Based on Gradio☆119Mar 15, 2026Updated 2 weeks ago
- Low-latency hash map using minimal perfect hash functions and compact encoding.☆38Nov 19, 2024Updated last year
- Compare MIDI with Vocal Pitch☆22Mar 5, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Websocket controlled Video Overlay server for OBS-Studio, XSplit, CasparCG, ProPresenter and everything with web browser.☆27Sep 14, 2024Updated last year
- Hashi is a full-stack social media app built with Flutter, designed to foster connections and create a vibrant online community.☆23Jul 12, 2024Updated last year
- Ask question to your PDF☆10Jun 11, 2023Updated 2 years ago
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆37Aug 7, 2025Updated 7 months ago
- Interface for Touch Remote Application Programming☆16Jan 22, 2024Updated 2 years ago
- Base class and framework for writing modules for modern Bitfocus Companion☆20Updated this week
- using g4f & embedding tools to mock openai server☆12Aug 20, 2023Updated 2 years ago