hbredin / pyannotebook
πΉ pyannote + π notebook = pyannotebook
β26Updated last year
Alternatives and similar repositories for pyannotebook:
Users that are interested in pyannotebook are comparing it to the libraries listed below
- β23Updated this week
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Updated last year
- Speaker change detection using SincNet and an LSTM/Transformerβ48Updated 8 months ago
- A speech signal processing library in Python with emphasis on deep learning.β31Updated 2 years ago
- Prosodic Speech Segmentation with Transformersβ25Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.β82Updated last year
- β20Updated 6 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?β11Updated 2 weeks ago
- Benchmarking different VAD models on AVA-Speech datasetβ14Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.β20Updated last week
- Clustering-based methods for overlapping diarizationβ77Updated last year
- C++ version of pyannote audio overlapped speech detection pipelineβ12Updated last year
- Paderbox: A collection of utilities for audio / speech processingβ38Updated 3 weeks ago
- Constrained Permutation Invariant Training, Speech Separationβ47Updated 4 years ago
- GPT for FACodecβ13Updated 11 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networksβ20Updated 2 weeks ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSIβ¦β21Updated 6 months ago
- β31Updated 11 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.β76Updated last year
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamicβ¦β46Updated 5 months ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Modelβ32Updated last year
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/projectβ¦β12Updated 2 years ago
- Deep Speech Distances PyTorchβ27Updated 3 years ago
- Viterbi decoding in PyTorchβ28Updated 3 weeks ago
- Multipurpose Multi Speaker Mixture Signal Generatorβ44Updated last month
- Implementation of the DIVA model of speech acquisition and production using PyTorchβ21Updated 2 years ago
- A collection of papers related to speech model compressionβ23Updated last year
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".β13Updated 2 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesisβ23Updated 3 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"β13Updated 2 years ago