hbredin / pyannotebook
πΉ pyannote + π notebook = pyannotebook
β26Updated last year
Alternatives and similar repositories for pyannotebook:
Users that are interested in pyannotebook are comparing it to the libraries listed below
- A speech signal processing library in Python with emphasis on deep learning.β31Updated 2 years ago
- Benchmarking different VAD models on AVA-Speech datasetβ11Updated last year
- Speaker change detection using SincNet and an LSTM/Transformerβ46Updated 7 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.β78Updated last year
- β18Updated 3 weeks ago
- β31Updated 9 months ago
- Clustering-based methods for overlapping diarizationβ74Updated last year
- Constrained Permutation Invariant Training, Speech Separationβ46Updated 4 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/projectβ¦β12Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.β30Updated last year
- A Python implementation of the Speech Intelligibility Indexβ41Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ94Updated 2 weeks ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.β44Updated 4 months ago
- Prosodic Speech Segmentation with Transformersβ25Updated 11 months ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive wayβ40Updated last year
- Streaming Audiotransformers for online Audio taggingβ43Updated 7 months ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approachβ72Updated 9 months ago
- Implementation of the DIVA model of speech acquisition and production using PyTorchβ21Updated 2 years ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based β¦β109Updated last week
- GPT for FACodecβ13Updated 10 months ago
- Repository for "Training Audio Captioning Models without Audio"β9Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Updated last year
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancementβ39Updated 5 months ago
- Multipurpose Multi Speaker Mixture Signal Generatorβ44Updated 3 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.β76Updated last year
- Inference code for PaSST, using the HEAR API.β31Updated last year
- β29Updated 6 months ago
- β56Updated 3 years ago
- Feature extractor for DL speech processing.β65Updated 2 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'β92Updated 6 months ago