dhimasryan / TMHINT-QI-VoiceMOS2023Links
☆17Updated 2 years ago
Alternatives and similar repositories for TMHINT-QI-VoiceMOS2023
Users that are interested in TMHINT-QI-VoiceMOS2023 are comparing it to the libraries listed below
Sorting:
- ☆59Updated last year
- ☆32Updated 11 months ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆33Updated 11 months ago
- ☆27Updated 2 months ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆41Updated last year
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated 2 years ago
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆15Updated 11 months ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- Whisper Speech Quality Assessment (WhiSQA)☆16Updated last month
- ☆27Updated 2 years ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆22Updated last year
- FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks☆15Updated 6 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆55Updated 2 years ago
- ☆55Updated 11 months ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆42Updated 2 years ago
- A benchmark for evaluating audio encoders on various audio tasks.☆29Updated 3 weeks ago
- Official PyTorch implementation of 'Rec-RIR: Monaural Blind Room Impulse Response Identification via DNN-based Reverberant Speech Reconst…☆23Updated 2 weeks ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆35Updated 3 weeks ago
- ☆31Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 3 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated last year
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆22Updated 4 years ago
- ☆31Updated 2 years ago
- ☆25Updated last year
- ☆66Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 9 months ago
- Pytorch implementation of subband decomposition☆92Updated 3 years ago