EtienneAb3d / karaok-AI
Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)
β63Updated 11 months ago
Related projects β
Alternatives and complementary repositories for karaok-AI
- The BEST music separation model with help of A.I. ... to my ears ! ππβ129Updated 5 months ago
- β70Updated last year
- Model for CDX23 (Cinematic Sound Demixing) contestβ38Updated 4 months ago
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlpβ17Updated last month
- BandIt: Cinematic Audio Source Separationβ94Updated 4 months ago
- Versatile AI-driven audio upscaler to enhance the quality of any audio.β60Updated 2 months ago
- β28Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ132Updated 5 months ago
- Chord conditioning implemented MusicGenβ46Updated 7 months ago
- Ultimate Vocal Remover CLI type for Google Colabβ45Updated 3 weeks ago
- Community framework for training tortoiseβ38Updated 2 years ago
- extract and isolate vocals from media files. supports multispeaker media as well.β41Updated 11 months ago
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction projectβ33Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning β¦β24Updated last year
- AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 fileβ¦β34Updated 2 months ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.β32Updated last year
- A testing repo to share code and thoughts on diarisationβ53Updated 7 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" modelsβ65Updated 2 years ago
- Implements ML audio separation algorithm on audio from YouTube or Spotify resulting in "stems" for download (e.g. vocals, drums, bass) inβ¦β25Updated 2 months ago
- A Gradio setup for Tortoise TTS.β45Updated last year
- Automatic lyrics alignment at phoneme or word level with a pre-trained deep neural network.β29Updated last year
- Pack cuda environment for bytesep music separation and provide a simple gui.β34Updated 2 years ago
- Text prompt steered synthetic audio generatorsβ45Updated 11 months ago
- generate granular word-level captions in srt formatβ57Updated 2 years ago
- AudioSR-Colab-Forkβ26Updated last month
- Real-time end-to-end singing voice convertionβ18Updated 2 weeks ago
- β93Updated 3 months ago
- A fast MP3 decoder for python, using minimp3β26Updated 2 years ago
- a notebook containing scripts, documentation, and examples for finetuning musicgenβ75Updated 7 months ago
- π¬ "Realtime" voice transcription and cloning using ElevenLabs's API.β49Updated last year