tobiasrordorf / SRT-to-CSV-and-audio-split
Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)
☆20Updated 5 years ago
Alternatives and similar repositories for SRT-to-CSV-and-audio-split:
Users that are interested in SRT-to-CSV-and-audio-split are comparing it to the libraries listed below
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- ☆28Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 7 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- Cantonese Text to Speech with VITS implementation☆20Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆68Updated last year
- ☆55Updated 8 months ago
- Vocal Remover using Deep Neural Networks☆17Updated 2 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆94Updated 5 months ago
- A simple voice conversion tool☆17Updated 3 years ago
- a Frontier Japanese Speech Generation net☆27Updated last week
- Finetuning VITS Efficiently☆32Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptions☆47Updated 2 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆20Updated last week
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆29Updated 2 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆25Updated last month
- Workflow for forced alignment between languages☆18Updated last year
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Updated 2 years ago
- ☆36Updated 6 months ago
- MFA acoustic model training based on Opencpop☆14Updated 2 years ago
- Sovits5 with RMVPE☆14Updated last year
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆73Updated last year
- Putting flows on top of neural transducers for better TTS☆62Updated 2 weeks ago
- 基于vits fastspeech2 visinger的tts模型☆23Updated 2 years ago
- ☆73Updated 2 years ago
- TransferTTS (Zero-Shot learning of VITS)☆95Updated 2 years ago