webaverse / LJSpeechToolsLinks
Tools to isolate speaker and transcribe unstructured audio clips
☆11Updated 3 years ago
Alternatives and similar repositories for LJSpeechTools
Users that are interested in LJSpeechTools are comparing it to the libraries listed below
Sorting:
- ☆20Updated 2 years ago
- ☆20Updated 4 years ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 3 years ago
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆15Updated 3 years ago
- Finally, some decent sample sentences☆23Updated 2 years ago
- Heteronym to Phoneme Parser☆18Updated 2 years ago
- [DEPRECEATED] Multi-Instrumental Music Transformer trained on 12GB/400k MIDIs☆17Updated 3 years ago
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆13Updated last year
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Updated last year
- Python scripts I made to make NNSVS labeling easier.☆25Updated 2 years ago
- This contains the Flax model of min(DALL·E) and code for converting it to PyTorch☆45Updated 3 years ago
- Tracking states of the arts and recent results (bibliography) on sound tasks.☆32Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆15Updated 4 years ago
- Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included…☆40Updated 3 years ago
- Voice swapping with VQ-VAE and diffusion models☆68Updated 4 years ago
- A web app that lets you play around with TalkNet models☆124Updated 2 years ago
- The demo page of UniAudio☆34Updated last year
- Real-time end-to-end singing voice convertion☆22Updated last year
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- 🦒 The world's largest corpus of CC0-1.0 licensed music (public domain music).☆30Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Updated 2 years ago
- Lyra V2 (SoundStream) running in the browser☆19Updated 2 years ago
- A realtime 3d spectrogram visualization of the user's microphone audio. Made with threeJs using shaders.☆53Updated last year
- Song Describer is a data collection platform for annotating music with textual descriptions.☆60Updated last year
- The original weights of some Caffe models, ported to PyTorch.☆11Updated 3 years ago
- ☆51Updated last year
- A fork of sinsy: HMM/DNN-based singing voice synthesis system☆71Updated 3 years ago
- ☆107Updated 2 years ago