hbredin / pyannotebookView external linksLinks
πΉ pyannote + π notebook = pyannotebook
β26Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for pyannotebook
Users that are interested in pyannotebook are comparing it to the libraries listed below
Sorting:
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)β15Oct 22, 2025Updated 3 months ago
- β14Jun 12, 2015Updated 10 years ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phonemeβ¦β23Aug 14, 2025Updated 6 months ago
- Learnable STRF, from Riad et al. 2021 JASAβ13Aug 21, 2021Updated 4 years ago
- β26Updated this week
- C++ version of pyannote audio overlapped speech detection pipelineβ13Feb 14, 2024Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.β93Oct 18, 2023Updated 2 years ago
- Implementation of vocoders empowered with pytorch lightningβ18Jan 27, 2024Updated 2 years ago
- β10Oct 16, 2025Updated 3 months ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"β11Dec 15, 2022Updated 3 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.β12Mar 15, 2025Updated 11 months ago
- Onset-and-Offset-Aware Sound Event Detectionβ20Feb 10, 2025Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. β¦β13Dec 4, 2024Updated last year
- β32Nov 18, 2025Updated 2 months ago
- β11Nov 7, 2024Updated last year
- eSNN - Learning similarity measure from dataβ12Nov 28, 2019Updated 6 years ago
- Testing sets for semanticVADβ20Feb 18, 2025Updated 11 months ago
- Discriminative Training of VBx Diarizationβ27Sep 23, 2024Updated last year
- β11Mar 22, 2023Updated 2 years ago
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Ratesβ12Mar 13, 2024Updated last year
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequβ¦β28Sep 20, 2025Updated 4 months ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatβ¦β33Jun 14, 2024Updated last year
- Whisper Speech Quality Assessment (WhiSQA)β16Oct 14, 2025Updated 4 months ago
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"β33Jan 28, 2026Updated 2 weeks ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Unitsβ18Oct 2, 2024Updated last year
- β17Apr 28, 2021Updated 4 years ago
- β13Mar 11, 2025Updated 11 months ago
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".β22Jun 10, 2024Updated last year
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)β31May 14, 2024Updated last year
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speeβ¦β17Sep 19, 2023Updated 2 years ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPAβ18Aug 16, 2024Updated last year
- FNSE-SBGAN: Far-field Speech Enhancement with SchrΓΆdinger Bridge and Generative Adversarial Networksβ17May 12, 2025Updated 9 months ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"β15Nov 14, 2023Updated 2 years ago
- Implementation of the paper "Can Large Language Models Predict Audio Effects Parameters from Natural Language?"β26May 27, 2025Updated 8 months ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSPβ¦β61Oct 7, 2020Updated 5 years ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implementβ16Sep 13, 2024Updated last year
- Voice conversion training with 109 speakers with limited training samplesβ35Dec 21, 2020Updated 5 years ago
- Simple Python package for fast DER computationβ35Jun 29, 2023Updated 2 years ago
- β68Feb 15, 2021Updated 5 years ago