xiuwenz2 / SAPC-templateView external linksLinks
☆14Feb 8, 2026Updated last week
Alternatives and similar repositories for SAPC-template
Users that are interested in SAPC-template are comparing it to the libraries listed below
Sorting:
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated 11 months ago
- Whisper Speech Quality Assessment (WhiSQA)☆16Oct 14, 2025Updated 4 months ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- Reference-aware automatic speech evaluation toolkit☆178Dec 5, 2024Updated last year
- Diffusion Model for Voice Conversion☆69Mar 14, 2024Updated last year
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆32Mar 14, 2025Updated 11 months ago
- This is a general framework for fake audio detection using pytorch lightning☆27Jul 24, 2025Updated 6 months ago
- OpenFLAM: Framewise Language Audio Model☆88Jan 14, 2026Updated last month
- Machine learning speaker characteristics☆43Updated this week
- ☆11Sep 4, 2023Updated 2 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated last year
- ☆37Jul 4, 2024Updated last year
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆12Nov 28, 2024Updated last year
- ☆16Jun 12, 2025Updated 8 months ago
- An interpreter in C for the language brainfuck.☆10Apr 12, 2023Updated 2 years ago
- ☆15Sep 16, 2024Updated last year
- ☆10Sep 2, 2024Updated last year
- Accompanying repository for the paper "Automatic Music Mixing Using a Generative Model of Effect Embeddings"☆21Jan 18, 2026Updated 3 weeks ago
- ACM MM 2022 paper_AVQA: A Dataset for Audio-Visual Question Answering on Videos☆15Aug 17, 2023Updated 2 years ago
- Latex template for CUHK PhD Thesis☆11Jun 29, 2025Updated 7 months ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆46Dec 27, 2022Updated 3 years ago
- ☆11Nov 7, 2024Updated last year
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Dec 15, 2022Updated 3 years ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 3 months ago
- VoxAngeles Corpus☆13Aug 23, 2025Updated 5 months ago
- ☆32Nov 18, 2025Updated 2 months ago
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆56Jun 1, 2025Updated 8 months ago
- Official implementation of the paper "Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task L…☆11Feb 14, 2024Updated 2 years ago
- ☆16Sep 19, 2023Updated 2 years ago
- Overlap-and-add convolution in Python aimed at applying reverberation in music and audio signals☆12Sep 12, 2017Updated 8 years ago
- ☆16May 2, 2025Updated 9 months ago
- ☆11Mar 22, 2023Updated 2 years ago
- EMO-SUPERB submission☆50Oct 13, 2025Updated 4 months ago
- This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets re…☆13Oct 8, 2025Updated 4 months ago
- ☆15Nov 10, 2025Updated 3 months ago
- Implementation of ConflictNET: End-to-End Learning for Speech-Based Conflict Intensity Estimation - IEEE Signal Processing Letters☆13Dec 8, 2022Updated 3 years ago
- Sound field reconstruction using neural processes with dynamic kernels☆15Mar 25, 2025Updated 10 months ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆12Oct 31, 2024Updated last year
- Implementation for WatchYourMouth: Silent Speech Recognition with Depth Sensing presented at CHI 2024☆16Oct 6, 2025Updated 4 months ago