webaverse / LJSpeechTools
Tools to isolate speaker and transcribe unstructured audio clips
☆11Updated 2 years ago
Alternatives and similar repositories for LJSpeechTools:
Users that are interested in LJSpeechTools are comparing it to the libraries listed below
- This contains the Flax model of min(DALL·E) and code for converting it to PyTorch☆46Updated 2 years ago
- ☆20Updated 3 years ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- ☆21Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Real-time end-to-end singing voice convertion☆21Updated 5 months ago
- Lyra V2 (SoundStream) running in the browser☆20Updated last year
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.☆10Updated 2 years ago
- Non Parallel Voice Conversion based on VITS☆24Updated 2 years ago
- The original weights of some Caffe models, ported to PyTorch.☆11Updated 3 years ago
- text-to-audio-latent-diffusion☆37Updated last year
- NeMo: a toolkit for conversational AI☆9Updated this week
- Heteronym to Phoneme Parser☆18Updated last year
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆11Updated 7 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- BEGANSing - Korean SVS + SVC + AudioSR☆11Updated last year
- Generate LoRA Data from Blender Renders and more!☆12Updated last year
- Tracking states of the arts and recent results (bibliography) on sound tasks.☆32Updated 2 years ago
- ☆8Updated 8 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 9 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆13Updated 2 weeks ago
- ☆8Updated 2 years ago
- A quick test using a Stable Diffusion server and Godot 4☆11Updated 2 years ago
- The demo page of UniAudio☆33Updated last year
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆12Updated 4 months ago
- Finally, some decent sample sentences☆22Updated last year
- ☆41Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 3 months ago
- Production-ready vocoder using BigVSAN☆11Updated last year