adefossez / seewav
Audio waveform visualisation, converts any audio to a nice video
☆233Updated last month
Alternatives and similar repositories for seewav:
Users that are interested in seewav are comparing it to the libraries listed below
- BandIt: Cinematic Audio Source Separation☆115Updated 9 months ago
- The Nendo AI Audio Tool Suite☆213Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆147Updated 10 months ago
- python wrapper for rubberband☆186Updated 6 months ago
- Fine-tune your own MusicGen with LoRA☆132Updated last year
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆272Updated 3 weeks ago
- Trainer for audio-diffusion-pytorch☆129Updated 2 years ago
- A collection of pre-trained audio models, in PyTorch.☆113Updated 2 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆117Updated 4 months ago
- A collection of useful audio datasets and transforms for PyTorch.☆139Updated 2 years ago
- Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…☆172Updated 9 months ago
- Your one-stop solution for voice dataset creation☆119Updated last year
- Encode and decode audio samples to/from compressed latent representations!☆199Updated 2 months ago
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆214Updated last year
- Pytorch implementation of the CREPE pitch tracker☆436Updated 10 months ago
- ☆165Updated last year
- This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.☆347Updated last year
- DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/☆384Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated last year
- Self-supervised learning for fast pitch estimation☆217Updated 2 months ago
- Audio datasets, easier.☆84Updated last year
- An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)☆166Updated last year
- music generation with masked transformers!☆325Updated this week
- Pre-trained model and script to automatically align lyrics to polyphonic audio☆107Updated 4 years ago
- Chord conditioning implemented MusicGen☆56Updated last year
- A simple library for Fréchet Audio Distance (FAD) calculation☆202Updated 2 weeks ago
- Community framework for training tortoise☆41Updated 2 years ago