hbredin / pyannotebook
πΉ pyannote + π notebook = pyannotebook
β26Updated last year
Alternatives and similar repositories for pyannotebook:
Users that are interested in pyannotebook are comparing it to the libraries listed below
- A speech signal processing library in Python with emphasis on deep learning.β31Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.β81Updated last year
- Benchmarking different VAD models on AVA-Speech datasetβ14Updated last year
- β27Updated 3 weeks ago
- Speaker change detection using SincNet and an LSTM/Transformerβ50Updated 9 months ago
- β31Updated last year
- Deep Speech Distances PyTorchβ27Updated 3 years ago
- This is a curated list of awesome Speech Bandwidth Extension tutorials, papers, libraries, datasets, tools, scripts and results. The purpβ¦β65Updated 4 years ago
- Tunable pipelinesβ32Updated last month
- Prosodic Speech Segmentation with Transformersβ25Updated last year
- Clustering-based methods for overlapping diarizationβ80Updated last year
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?β11Updated last month
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamicβ¦β47Updated 6 months ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/projectβ¦β12Updated 2 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.β76Updated last year
- Constrained Permutation Invariant Training, Speech Separationβ47Updated 4 years ago
- Multipurpose Multi Speaker Mixture Signal Generatorβ44Updated 2 months ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddingsβ28Updated 6 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Updated last year
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generationβ74Updated 3 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networksβ19Updated 2 weeks ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.β26Updated 8 months ago
- β15Updated 2 years ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separationβ12Updated 8 months ago
- Implementation of vocoders empowered with pytorch lightningβ17Updated last year
- Audio-visual diarization pipeline used for creating VoxConverse datasetβ20Updated last month
- β17Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ100Updated 2 months ago
- Paderbox: A collection of utilities for audio / speech processingβ38Updated last month
- β56Updated 2 years ago