ardha27 / WhisperX-Youtube-SRTLinks
Create Youtube SRT with WhisperX using Google Colab
☆20Updated 2 years ago
Alternatives and similar repositories for WhisperX-Youtube-SRT
Users that are interested in WhisperX-Youtube-SRT are comparing it to the libraries listed below
Sorting:
- Ultimate Vocal Remover CLI type for Google Colab☆58Updated 2 months ago
- Colaboratory Notebook for Ultimate Vocal Remover☆96Updated 11 months ago
- 数据集自动化制作脚本☆72Updated 2 years ago
- Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.☆117Updated last year
- RVC Inference with multiple model and huggingface support☆106Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆68Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆246Updated last year
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆20Updated 2 months ago
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆33Updated last year
- Ultimate Vocal Remover Inference CLI☆81Updated 5 months ago
- ☆145Updated 5 months ago
- singing voice conversion based on glow-tts☆11Updated last year
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆128Updated 3 months ago
- API for a Vocal Remover that uses Deep Neural Networks.☆113Updated last year
- Ultimate Vocal Remover CLI☆149Updated 5 months ago
- Pipelines and tools to build your own DiffSinger dataset.☆112Updated 3 months ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆148Updated last year
- RTVC: Real-Time Voice Conversion GUI☆56Updated last year
- RVC Onnx Infer- Upgraded and simplified-ish☆21Updated last year
- 基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…☆55Updated last year
- 基于vits fastspeech2 visinger的tts模型☆24Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆23Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆153Updated last year
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆49Updated 3 months ago
- Cantonese Text to Speech with VITS implementation☆31Updated 2 years ago
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Updated last year
- A Japanese G2P tool based on pyopenjtalk☆25Updated 2 years ago
- extract and isolate vocals from media files. supports multispeaker media as well.☆46Updated last year
- Sovits5 with RMVPE☆14Updated last year