tobiasrordorf / SRT-to-CSV-and-audio-split
Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)
☆20Updated 5 years ago
Alternatives and similar repositories for SRT-to-CSV-and-audio-split:
Users that are interested in SRT-to-CSV-and-audio-split are comparing it to the libraries listed below
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆16Updated last year
- This repository contains code for fine-tuning the Whisper speech-to-text model.☆8Updated 2 months ago
- A simple voice conversion tool☆17Updated 3 years ago
- High quality text-to-speech based on StyleTTS 2.☆36Updated this week
- RVC Onnx Infer- Upgraded and simplified-ish☆21Updated 11 months ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your language☆26Updated last week
- a Frontier Japanese Speech Generation net☆31Updated last month
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 10 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆17Updated 5 months ago
- ☆67Updated last year
- ☆29Updated last year
- ☆13Updated 8 months ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- AudioSR-Upsampling (any -> 48kHz)☆40Updated last year
- Zero-Shot Emotion Style Transfer☆45Updated this week
- Official Code for ParrotTTS☆48Updated 6 months ago
- ☆13Updated last week
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆69Updated 6 months ago
- 基于vits fastspeech2 visinger的tts模型☆23Updated 2 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- Finally, some decent sample sentences☆22Updated last year
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- ☆27Updated 3 weeks ago
- My vocoder experiments☆28Updated 6 months ago
- Collection of scripts from mHuBERT-147.☆24Updated 5 months ago
- ☆12Updated 2 years ago
- Ultimate Vocal Remover Inference CLI☆66Updated 2 months ago