adefossez / seewav
Audio waveform visualisation, converts any audio to a nice video
☆227Updated last year
Alternatives and similar repositories for seewav:
Users that are interested in seewav are comparing it to the libraries listed below
- BandIt: Cinematic Audio Source Separation☆104Updated 6 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆126Updated 9 months ago
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆253Updated 3 weeks ago
- A collection of pre-trained audio models, in PyTorch.☆112Updated 2 years ago
- Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)☆188Updated 9 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆130Updated last year
- Your one-stop solution for voice dataset creation☆117Updated last year
- Self-supervised learning for fast pitch estimation☆204Updated last month
- [WIP] VoiceSmith makes training text to speech models easy.☆224Updated 2 years ago
- music generation with masked transformers!☆318Updated this week
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆126Updated last year
- Pitch Estimating Neural Networks (PENN)☆240Updated 5 months ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆229Updated 2 weeks ago
- ☆73Updated last year
- ☆161Updated last year
- Pytorch implementation of the CREPE pitch tracker☆423Updated 7 months ago
- Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.☆119Updated 3 years ago
- A collection of useful audio datasets and transforms for PyTorch.☆138Updated last year
- This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.☆342Updated last year
- Trainer for audio-diffusion-pytorch☆128Updated 2 years ago
- Noise removal/ reducer from the audio file in python. De-noising is done using Wavelets and thresholding is done by VISU Shrink threshold…☆179Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆231Updated 7 months ago
- Encode and decode audio samples to/from compressed latent representations!☆172Updated 5 months ago
- Fine-tune your own MusicGen with LoRA☆123Updated 9 months ago
- Performant and accurate speech recognition built on Pytorch☆251Updated 2 years ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆169Updated 4 months ago
- The Nendo AI Audio Tool Suite☆213Updated 9 months ago
- a notebook containing scripts, documentation, and examples for finetuning musicgen☆83Updated 9 months ago
- open soundstream-ish VAE codecs for downstream neural audio synthesis☆116Updated last year
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)☆155Updated 2 years ago