hbredin / pyannotebook
πΉ pyannote + π notebook = pyannotebook
β26Updated last year
Alternatives and similar repositories for pyannotebook:
Users that are interested in pyannotebook are comparing it to the libraries listed below
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/projectβ¦β12Updated 2 years ago
- A speech signal processing library in Python with emphasis on deep learning.β31Updated 2 years ago
- β21Updated last month
- Speaker change detection using SincNet and an LSTM/Transformerβ46Updated 7 months ago
- Deep Speech Distances PyTorchβ27Updated 2 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorchβ21Updated 2 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?β11Updated this week
- C++ version of pyannote audio overlapped speech detection pipelineβ11Updated last year
- Benchmarking different VAD models on AVA-Speech datasetβ13Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.β30Updated last year
- β12Updated 3 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorchβ18Updated last week
- A collection of papers related to speech model compressionβ24Updated last year
- Streaming Audiotransformers for online Audio taggingβ43Updated 8 months ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generationβ74Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β28Updated last year
- β11Updated 2 years ago
- Official implementation of Self-Remixingβ13Updated last year
- Prosodic Speech Segmentation with Transformersβ25Updated 11 months ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even wheβ¦β34Updated 5 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.β76Updated last year
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesisβ23Updated 3 years ago
- β31Updated 10 months ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancementβ22Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.β19Updated 3 months ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".β13Updated 2 years ago
- Multipurpose Multi Speaker Mixture Signal Generatorβ44Updated 2 weeks ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.β26Updated 6 months ago
- GPT for FACodecβ13Updated 10 months ago