webaverse / LJSpeechTools
Tools to isolate speaker and transcribe unstructured audio clips
☆11Updated 2 years ago
Alternatives and similar repositories for LJSpeechTools:
Users that are interested in LJSpeechTools are comparing it to the libraries listed below
- ☆20Updated 3 years ago
- ☆21Updated last year
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- A quick test using a Stable Diffusion server and Godot 4☆11Updated last year
- text-to-audio-latent-diffusion☆37Updated last year
- [DEPRECEATED] Multi-Instrumental Music Transformer trained on 12GB/400k MIDIs☆17Updated 2 years ago
- Repo for storing the files I use to make animations with big-sleep, deep-daze, and VQGAN + CLIP.☆16Updated 3 years ago
- Hifi-like Vocoder implemented in PyTorch☆13Updated 2 years ago
- ☆35Updated 2 years ago
- This contains the Flax model of min(DALL·E) and code for converting it to PyTorch☆46Updated 2 years ago
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆14Updated 2 years ago
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- An library for editing and rendering motion of 3D characters with deep learning.☆10Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated last year
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆35Updated 3 years ago
- Unity WebGL template for Hugging Face Spaces☆14Updated 3 years ago
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆26Updated 2 years ago
- [DEPRECIATED] Symbolic MIDI Music AI implementation☆19Updated 2 years ago
- Make-A-Video Latent Diffusion Model☆18Updated last year
- Blender Keyframe Exporter for AI Animation☆13Updated 2 years ago
- [DEPRECEATED] Morpheus Music AI implementation spin-off :)☆17Updated 2 years ago
- Tracking states of the arts and recent results (bibliography) on sound tasks.☆32Updated 2 years ago
- Finally, some decent sample sentences☆22Updated last year
- ☆11Updated last year
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆31Updated last year
- NeMo: a toolkit for conversational AI☆9Updated last week
- Stable Diffusion WebUI server forked with extra features - designed for Seth's AI Tools client☆42Updated this week
- ☆39Updated 3 months ago
- NeMo: a toolkit for conversational AI☆10Updated 2 years ago