pytorch / audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆2,582Updated this week
Alternatives and similar repositories for audio:
Users that are interested in audio are comparing it to the libraries listed below
- A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.☆1,925Updated last month
- Audio processing by using pytorch 1D convolution network☆1,042Updated 11 months ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,374Updated 2 years ago
- The PyTorch-based audio source separation toolkit for researchers☆2,311Updated this week
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,310Updated 2 weeks ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…☆1,714Updated last year
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆983Updated this week
- A Python wrapper for Kaldi☆1,006Updated 5 months ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,146Updated 3 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,584Updated 8 months ago
- End-to-End Speech Processing Toolkit☆8,686Updated this week
- Tools for handling speech data in machine learning projects.☆972Updated 3 weeks ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆1,787Updated 7 months ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆774Updated last week
- SoundFile is an audio library based on libsndfile, CFFI, and NumPy☆728Updated last week
- Python interface to the WebRTC Voice Activity Detector☆2,120Updated 6 months ago
- A Flow-based Generative Network for Speech Synthesis☆2,300Updated last year
- ☆1,393Updated 5 months ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆989Updated last year
- The Implementation of FastSpeech based on pytorch.☆862Updated last year
- A PyTorch-based Speech Toolkit☆9,189Updated this week
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆953Updated this week
- OpenL3: Open-source deep audio and image embeddings☆483Updated last year
- Collection of audio-focused loss functions in PyTorch☆756Updated 5 months ago
- Speech Recognition using DeepSpeech2.☆2,116Updated 2 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,298Updated 7 months ago
- An implementation of WaveNet with fast generation☆981Updated 4 years ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,196Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆2,015Updated 5 months ago
- A Python wrapper for the high-quality vocoder "World"☆736Updated last year