webaverse / LJSpeechToolsLinks
Tools to isolate speaker and transcribe unstructured audio clips
☆11Updated 2 years ago
Alternatives and similar repositories for LJSpeechTools
Users that are interested in LJSpeechTools are comparing it to the libraries listed below
Sorting:
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- A fast MP3 decoder for python, using minimp3☆29Updated 2 years ago
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- ☆20Updated 3 years ago
- Repo for storing the files I use to make animations with big-sleep, deep-daze, and VQGAN + CLIP.☆16Updated 3 years ago
- Update: Ignore this repo, check out @lucidrains' implementation https://github.com/lucidrains/musiclm-pytorch☆15Updated 2 years ago
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆12Updated 9 months ago
- This contains the Flax model of min(DALL·E) and code for converting it to PyTorch☆45Updated 2 years ago
- An library for editing and rendering motion of 3D characters with deep learning.☆10Updated last year
- Heteronym to Phoneme Parser☆18Updated last year
- text-to-audio-latent-diffusion☆37Updated last year
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆15Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 11 months ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 2 years ago
- Code to reproduce the experiments presented in the article "Data-Efficient Playlist Captioning With Musical and Linguistic Knowledge" (EM…☆18Updated 2 years ago
- ☆21Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago
- [DEPRECIATED] Symbolic MIDI Music AI implementation☆19Updated 3 years ago
- ☆18Updated 2 years ago
- The original weights of some Caffe models, ported to PyTorch.☆11Updated 3 years ago
- Real-time end-to-end singing voice convertion☆22Updated 8 months ago
- NeMo: a toolkit for conversational AI☆9Updated 3 weeks ago
- ☆35Updated 3 years ago
- jupyter/colab implementation of stable-diffusion using k_lms sampler, cpu draw manual seeding, and quantize.py fix☆38Updated 2 years ago
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆26Updated 2 years ago
- High fidelity music synthesis using diffusion and UnivNet.☆9Updated last year
- Mix between music tracks using machine learning☆11Updated last year
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆32Updated last year
- [DEPRECEATED] Multi-Instrumental Music Transformer trained on 12GB/400k MIDIs☆17Updated 2 years ago
- ☆107Updated last year