webaverse / LJSpeechToolsLinks
Tools to isolate speaker and transcribe unstructured audio clips
☆11Updated 2 years ago
Alternatives and similar repositories for LJSpeechTools
Users that are interested in LJSpeechTools are comparing it to the libraries listed below
Sorting:
- ☆20Updated 3 years ago
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 2 years ago
- Repo for storing the files I use to make animations with big-sleep, deep-daze, and VQGAN + CLIP.☆16Updated 3 years ago
- ☆21Updated 2 years ago
- A fast MP3 decoder for python, using minimp3☆29Updated 2 years ago
- This contains the Flax model of min(DALL·E) and code for converting it to PyTorch☆45Updated 2 years ago
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.☆11Updated 2 years ago
- Tracking states of the arts and recent results (bibliography) on sound tasks.☆32Updated 2 years ago
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- Visual search interface☆11Updated 3 years ago
- Unity WebGL template for Hugging Face Spaces☆14Updated 3 years ago
- text-to-audio-latent-diffusion☆37Updated last year
- NeMo: a toolkit for conversational AI☆9Updated this week
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 11 months ago
- Finally, some decent sample sentences☆23Updated last year
- Artistic Radiance Fields☆14Updated 3 years ago
- Karras et al. (2022) diffusion models for PyTorch☆12Updated 2 years ago
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆12Updated 9 months ago
- jupyter/colab implementation of stable-diffusion using k_lms sampler, cpu draw manual seeding, and quantize.py fix☆38Updated 2 years ago
- ☆35Updated 2 years ago
- Blender Keyframe Exporter for AI Animation☆13Updated 2 years ago
- Make-A-Video Latent Diffusion Model☆18Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- A simple voice conversion tool☆17Updated 3 years ago
- Another ENUNU for enthusiasts and developers, easy to catch up with NNSVS☆13Updated 2 months ago
- Score- and Lyrics-Free Singing Voice Generation☆28Updated 5 years ago
- A fast, local neural text to speech system☆14Updated 4 months ago
- The original weights of some Caffe models, ported to PyTorch.☆11Updated 3 years ago