webaverse / LJSpeechTools
Tools to isolate speaker and transcribe unstructured audio clips
☆11Updated 2 years ago
Alternatives and similar repositories for LJSpeechTools
Users that are interested in LJSpeechTools are comparing it to the libraries listed below
Sorting:
- ☆20Updated 3 years ago
- ☆21Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- ☆35Updated 2 years ago
- Repo for storing the files I use to make animations with big-sleep, deep-daze, and VQGAN + CLIP.☆16Updated 3 years ago
- Finally, some decent sample sentences☆22Updated last year
- Lyra V2 (SoundStream) running in the browser☆20Updated last year
- An library for editing and rendering motion of 3D characters with deep learning.☆10Updated last year
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 9 months ago
- text-to-audio-latent-diffusion☆37Updated last year
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆11Updated 7 months ago
- Python C extension for the eSpeak speech synthesizer☆11Updated 4 years ago
- NeMo: a toolkit for conversational AI☆9Updated this week
- Heteronym to Phoneme Parser☆18Updated last year
- A quick test using a Stable Diffusion server and Godot 4☆11Updated 2 years ago
- [DEPRECIATED] Symbolic MIDI Music AI implementation☆19Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆35Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- ☆10Updated 6 months ago
- ☆8Updated 2 years ago
- Skybox previewer and generator using BlockadeLabs☆15Updated 2 years ago
- The original weights of some Caffe models, ported to PyTorch.☆11Updated 3 years ago
- jupyter/colab implementation of stable-diffusion using k_lms sampler, cpu draw manual seeding, and quantize.py fix☆38Updated 2 years ago
- ☆23Updated last year
- A simple voice conversion tool☆17Updated 3 years ago
- Contrastive Language-Audio Pretraining