saisyam / pywavLinks
Reading and Writing .WAV files in Python
☆19Updated 5 years ago
Alternatives and similar repositories for pywav
Users that are interested in pywav are comparing it to the libraries listed below
Sorting:
- Reproducible experimental protocols for multimedia (audio, video, text) database☆102Updated 4 months ago
- Predicts the level of noise and reverberation on your audiofiles☆152Updated last week
- Fast and high quality sample-rate conversion library for Python☆95Updated 3 weeks ago
- ☆40Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆84Updated last year
- Tunable pipelines☆34Updated 4 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 4 months ago
- ☆40Updated last week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆94Updated 5 months ago
- SelfRemaster: SSL Speech Restoration☆89Updated last year
- Advanced data structures for handling temporal segments with attached labels.☆113Updated 4 months ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated 2 years ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆138Updated 3 months ago
- ☆56Updated 2 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆141Updated 3 weeks ago
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆135Updated 5 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆165Updated 2 weeks ago
- ☆86Updated 8 months ago
- Clustering-based methods for overlapping diarization☆80Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆52Updated last month
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆43Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆99Updated 8 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆115Updated 2 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆63Updated 2 months ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆64Updated 2 years ago
- Python Wrapper of Silero VAD☆55Updated last month
- Putting flows on top of neural transducers for better TTS☆62Updated this week
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆97Updated 11 months ago