webaverse / LJSpeechToolsLinks
Tools to isolate speaker and transcribe unstructured audio clips
☆11Updated 3 years ago
Alternatives and similar repositories for LJSpeechTools
Users that are interested in LJSpeechTools are comparing it to the libraries listed below
Sorting:
- ☆20Updated 2 years ago
- ☆20Updated 4 years ago
- Heteronym to Phoneme Parser☆19Updated 2 years ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 3 years ago
- Real-time end-to-end singing voice convertion☆23Updated last year
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆14Updated last year
- Streamlit app to visualize and edit TTS datasets☆15Updated 4 years ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated 2 years ago
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆15Updated 3 years ago
- multimodal probabilistic autoregressive models☆19Updated 2 years ago
- Demo for 2022 Interspeech☆29Updated 3 years ago
- Finally, some decent sample sentences☆23Updated 2 years ago
- Voice swapping with VQ-VAE and diffusion models☆68Updated 4 years ago
- A web app that lets you play around with TalkNet models☆124Updated 2 years ago
- Algorithmic Intelligence Symbolic Music Paulstrech Augmentator and Generator☆11Updated 4 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Updated last year
- [DEPRECIATED] Symbolic MIDI Music AI implementation☆20Updated 3 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- Deep learning toolkit for image, video, and audio synthesis☆107Updated 3 years ago
- Repo for storing the files I use to make animations with big-sleep, deep-daze, and VQGAN + CLIP.☆16Updated 4 years ago
- Demo for 2022 ICASSP☆64Updated 3 years ago
- [DEPRECEATED] A miniature replica of OpenAI's MuseNet☆16Updated 3 years ago
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆27Updated 2 years ago
- CLIP and PASTE: Using AI to Create Photo Collages from Text Prompts☆29Updated 3 years ago
- Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included…☆40Updated 3 years ago
- Python scripts I made to make NNSVS labeling easier.☆27Updated 2 years ago
- Implementation of SampleRNN for generating novel ambient music from raw audio source material☆10Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 3 years ago
- A quick test using a Stable Diffusion server and Godot 4☆11Updated 2 years ago