DoMusic / Hybrid-NetLinks
Real-time audio to chords, lyrics, beat, and melody.
☆706Updated last year
Alternatives and similar repositories for Hybrid-Net
Users that are interested in Hybrid-Net are comparing it to the libraries listed below
Sorting:
- example free website for client-side music demixing with Demucs + WebAssembly☆348Updated 5 months ago
- A transformer-based network model for pitch detection☆165Updated 2 months ago
- YouTube video to chords, lyrics, beat and melody.☆256Updated last year
- A vocal pitch correction web application (like Autotune)☆316Updated 2 years ago
- a co-creative looper that uses generative modeling to **not** repeat itself.☆287Updated 5 months ago
- A novel human-interaction method for real-time speech extraction on headphones.☆584Updated last year
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆867Updated 6 months ago
- OpenCV+YOLO+LLAVA powered video surveillance system☆776Updated last week
- A diffusion model to colorize black and white images☆779Updated 2 years ago
- Polyrhythmically-inclinded Midi Drum generator☆297Updated 2 years ago
- This is a python implementation for stitching images.☆233Updated last year
- ☆155Updated 11 months ago
- ☆442Updated 10 months ago
- Sonic Sound Picture (SSP) is a free, offline, and customizable music/audio visualizer software. With a range of templates to choose from,…☆239Updated 2 years ago
- The Sol Mate GPT but on your e-Paper display!☆314Updated 11 months ago
- Pytorch based speech enhancement toolkit.☆337Updated last year
- ☆163Updated last year
- ☆153Updated last week
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR mode…☆897Updated last year
- turnkey self-hosted offline transcription and diarization service with llm summary☆890Updated last year
- Super simple MLX (apple silicon) CLIP based photo similarity web app☆489Updated last year
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆785Updated last year
- A simple "Be My Eyes" web app with a llama.cpp/llava backend☆494Updated last year
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆280Updated last month
- ☆58Updated 2 years ago
- Semantic Image Search CLI tool.☆563Updated last year
- Homemade automated solar concentrator 🔧 ☀️ 🔎☆356Updated last year
- Docker-based inference engine for AMD GPUs☆230Updated 11 months ago
- AI Prediction api of the MusicLang package☆289Updated last year
- An advanced automation framework for audio mixer consoles, OBS, PTZ cameras and more based on the Open Sound Control protocol.☆112Updated last year