saisyam / pywavLinks
Reading and Writing .WAV files in Python
☆19Updated 6 years ago
Alternatives and similar repositories for pywav
Users that are interested in pywav are comparing it to the libraries listed below
Sorting:
- On-device noise suppression powered by deep learning☆75Updated 2 months ago
- A curated list of awesome voice activity detection☆67Updated 11 months ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆34Updated 6 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Advanced data structures for handling temporal segments with attached labels.☆121Updated last month
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated last month
- ☆91Updated last year
- Fast and high quality sample-rate conversion library for Python☆104Updated 2 weeks ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated 2 years ago
- Handling audio files in Python☆38Updated 2 weeks ago
- Speaker change detection using SincNet and an LSTM/Transformer☆55Updated 5 months ago
- On-device voice activity detection (VAD) powered by deep learning☆232Updated last month
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆177Updated 2 weeks ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆175Updated last year
- Model for recasing and repunctuating ASR transcripts☆141Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last week
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Python bindings around the LAME encoder☆62Updated 9 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆90Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆166Updated 4 months ago
- ☆43Updated last year
- Tunable pipelines☆40Updated last month
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆147Updated 4 months ago
- ONNX Inference of Pyannote Segmentation☆95Updated 10 months ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated 2 years ago
- SelfRemaster: SSL Speech Restoration☆90Updated last year
- Putting flows on top of neural transducers for better TTS☆64Updated 2 weeks ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆59Updated 10 months ago