Sreyan88 / LAPEView external linksLinks
A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)
☆30Jul 9, 2024Updated last year
Alternatives and similar repositories for LAPE
Users that are interested in LAPE are comparing it to the libraries listed below
Sorting:
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 10 months ago
- ☆24Mar 29, 2025Updated 10 months ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Aug 29, 2024Updated last year
- pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.☆48Apr 18, 2025Updated 9 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 6 months ago
- ☆21Jul 15, 2024Updated last year
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- SA-toolkit: Speaker speech anonymization toolkit in python☆30Sep 18, 2025Updated 4 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆96Nov 20, 2024Updated last year
- Russian phonetical transcription☆11Nov 19, 2025Updated 2 months ago
- ☆108Oct 1, 2024Updated last year
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆41Mar 3, 2025Updated 11 months ago
- Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730☆131Dec 8, 2023Updated 2 years ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆46Nov 19, 2024Updated last year
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆28Mar 10, 2024Updated last year
- Streaming Vocos☆29Jun 10, 2025Updated 8 months ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- ☆13Nov 22, 2022Updated 3 years ago
- REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR☆14Dec 11, 2024Updated last year
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Nov 8, 2023Updated 2 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 4 months ago
- An implementation of Neural Style Transfer for Audio using Pytorch.☆10Dec 14, 2017Updated 8 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Dec 15, 2022Updated 3 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- ☆44Sep 19, 2024Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- ☆11May 7, 2022Updated 3 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- ☆14Nov 26, 2024Updated last year
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago
- ☆15Nov 11, 2024Updated last year
- ☆23Dec 6, 2025Updated 2 months ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago