saisyam / pywav
Reading and Writing .WAV files in Python
☆18Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for pywav
- Tunable pipelines☆29Updated 3 weeks ago
- Create an LJSpeech structured voice dataset on wave input☆19Updated last month
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆233Updated 2 weeks ago
- Unofficial implementation of miipher☆111Updated 6 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆129Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆61Updated last year
- A Hackable speech recognition library.☆25Updated 3 weeks ago
- Predicts the level of noise and reverberation on your audiofiles☆138Updated 5 months ago
- ☆78Updated last month
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆70Updated last year
- 🎹 pyannote + 🗒 notebook = pyannotebook☆25Updated last year
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆147Updated 2 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆99Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆83Updated 3 weeks ago
- VoiceBox neural network implementation☆96Updated 3 months ago
- ONNX Inference of Pyannote Segmentation☆65Updated 2 months ago
- Python Wrapper of Silero VAD☆41Updated 2 weeks ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆125Updated 2 weeks ago
- Pytorch implementation of BigVSAN☆198Updated 7 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆45Updated this week
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆111Updated 3 months ago
- Advanced data structures for handling temporal segments with attached labels.☆98Updated 4 months ago
- An unofficial PyTorch implementation of VALL-E☆75Updated this week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- This is the audio sample repository for speech separation model "MossFormer2".☆105Updated 7 months ago
- ☆32Updated 9 months ago
- Python bindings around the LAME encoder☆51Updated 2 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆84Updated 3 weeks ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆140Updated 2 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆128Updated 9 months ago