tobiasrordorf / SRT-to-CSV-and-audio-split
Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)
☆18Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for SRT-to-CSV-and-audio-split
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆50Updated 3 years ago
- 4G GPU & 10 Minutes for train☆12Updated last year
- 基于vits fastspeech2 visinger的tts模型☆23Updated last year
- Ultimate Vocal Remover Inference CLI☆51Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆27Updated last year
- 56 language, 1 model Multilingual ASR☆24Updated 3 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆32Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆29Updated 2 years ago
- A minimum inference engine for DiffSinger☆34Updated 7 months ago
- Finally, some decent sample sentences☆22Updated 11 months ago
- ☆32Updated 2 months ago
- singing voice conversion without f0☆22Updated last year
- ☆28Updated last year
- Non Parallel Voice Conversion based on VITS☆23Updated last year
- ☆10Updated 3 months ago
- RTVC: Real-Time Voice Conversion GUI☆51Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- ☆14Updated last year
- Just another FastSpeech 2 but cleaner code :)☆25Updated 4 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- Fine-Tune Whisper with Transformers and PEFT☆38Updated last year
- Text To Speech Multilingual Support (+20 Language)☆35Updated last year
- RVC Onnx Infer- Upgraded and simplified-ish☆19Updated 6 months ago
- 基于FreeVC的歌声转换☆21Updated last year
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆56Updated last year
- My vocoder experiments☆21Updated last month