tobiasrordorf / SRT-to-CSV-and-audio-split
Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)
☆18Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for SRT-to-CSV-and-audio-split
- Non Parallel Voice Conversion based on VITS☆23Updated last year
- 'Grad-TTS' with Multilingual Cleaners☆10Updated 7 months ago
- singing voice conversion based on glow-tts☆11Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- ☆28Updated 11 months ago
- ☆39Updated last year
- A simple voice conversion tool☆15Updated 2 years ago
- Workflow for forced alignment between languages☆17Updated 8 months ago
- Finally, some decent sample sentences☆22Updated 11 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- Incorporating AutoVocoder to MB-iSTFT-VITS☆44Updated last year
- List of repositories relevant to VITS.☆35Updated last year
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆28Updated 2 years ago
- Heteronym to Phoneme Parser☆15Updated last year
- ☆20Updated 2 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆56Updated last year
- Finetuning VITS Efficiently☆32Updated last year
- Japanese Dataset to Multi Language TTS (Only for Japanese Dataset)☆3Updated 11 months ago
- ☆30Updated last year
- ☆24Updated 4 months ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 8 months ago
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model☆10Updated last year
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated last year
- Sovits5 with RMVPE☆14Updated last year
- ☆22Updated last year
- TransferTTS (Zero-Shot learning of VITS)☆89Updated 2 years ago
- Monotonic Alignment Search☆86Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 3 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated 2 years ago