webaverse / LJSpeechTools
Tools to isolate speaker and transcribe unstructured audio clips
☆11Updated 2 years ago
Alternatives and similar repositories for LJSpeechTools:
Users that are interested in LJSpeechTools are comparing it to the libraries listed below
- ☆20Updated 3 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Generate LoRA Data from Blender Renders and more!☆12Updated last year
- The original weights of some Caffe models, ported to PyTorch.☆11Updated 3 years ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 2 years ago
- ☆21Updated 2 years ago
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.☆10Updated last year
- Blender Keyframe Exporter for AI Animation☆13Updated 2 years ago
- NeMo: a toolkit for conversational AI☆10Updated 2 years ago
- General Repo for blender and stable-diffusion integration☆75Updated 2 years ago
- Repo for storing the files I use to make animations with big-sleep, deep-daze, and VQGAN + CLIP.☆16Updated 3 years ago
- Windows compatible code for the paper "Jukebox: A Generative Model for Music"☆13Updated 2 years ago
- Finally, some decent sample sentences☆22Updated last year
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- This contains the Flax model of min(DALL·E) and code for converting it to PyTorch☆46Updated 2 years ago
- jupyter/colab implementation of stable-diffusion using k_lms sampler, cpu draw manual seeding, and quantize.py fix☆38Updated 2 years ago
- Make-A-Video Latent Diffusion Model☆18Updated last year
- NeMo: a toolkit for conversational AI☆9Updated last week
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆11Updated 6 months ago
- Unofficial implementation of Neural Analysis and Synthesis☆7Updated 3 years ago
- A quick test using a Stable Diffusion server and Godot 4☆11Updated 2 years ago
- Heteronym to Phoneme Parser☆18Updated last year
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆26Updated 2 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated 11 months ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆39Updated 2 years ago
- [DEPRECIATED] Symbolic MIDI Music AI implementation☆19Updated 2 years ago
- Update: Ignore this repo, check out @lucidrains' implementation https://github.com/lucidrains/musiclm-pytorch☆15Updated 2 years ago
- A fast MP3 decoder for python, using minimp3☆28Updated 2 years ago
- Algorithmic Intelligence Symbolic Music Paulstrech Augmentator and Generator☆11Updated 3 years ago