DoMusic / Hybrid-NetLinks
Real-time audio to chords, lyrics, beat, and melody.
☆713Updated last year
Alternatives and similar repositories for Hybrid-Net
Users that are interested in Hybrid-Net are comparing it to the libraries listed below
Sorting:
- example free website for client-side music demixing with Demucs + WebAssembly☆352Updated 8 months ago
- A transformer-based network model for pitch detection☆166Updated 5 months ago
- A vocal pitch correction web application (like Autotune)☆321Updated 2 years ago
- YouTube video to chords, lyrics, beat and melody.☆256Updated last year
- a co-creative looper that uses generative modeling to **not** repeat itself.☆288Updated 8 months ago
- This is a python implementation for stitching images.☆233Updated last year
- A novel human-interaction method for real-time speech extraction on headphones.☆595Updated last year
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆883Updated last month
- ☆175Updated 2 months ago
- A diffusion model to colorize black and white images☆784Updated 2 years ago
- ☆154Updated last year
- Super simple MLX (apple silicon) CLIP based photo similarity web app☆493Updated last year
- ☆443Updated last year
- Sonic Sound Picture (SSP) is a free, offline, and customizable music/audio visualizer software. With a range of templates to choose from,…☆241Updated 2 years ago
- The Sol Mate GPT but on your e-Paper display!☆313Updated last week
- OpenCV+YOLO+LLAVA powered video surveillance system☆780Updated 2 months ago
- A lightweight text-to-speech model with zero-shot voice cloning☆598Updated this week
- ☆163Updated last year
- Docker-based inference engine for AMD GPUs☆231Updated last year
- AI Prediction api of the MusicLang package☆293Updated last year
- Polyrhythmically-inclinded Midi Drum generator☆303Updated 2 years ago
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆281Updated 2 weeks ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆785Updated last year
- State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI to…☆414Updated 3 years ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR mode…☆907Updated last week
- Ctrl-f for videos☆274Updated last year
- Sound Recognition☆297Updated 2 years ago
- A simple "Be My Eyes" web app with a llama.cpp/llava backend☆492Updated 2 years ago
- Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.☆639Updated 10 months ago
- Semantic Image Search CLI tool.☆577Updated last year