r10a / music-speech-classifierLinks
Aim to implement a classifier which classifies an audio sample into speech or music.
☆10Updated 6 years ago
Alternatives and similar repositories for music-speech-classifier
Users that are interested in music-speech-classifier are comparing it to the libraries listed below
Sorting:
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆99Updated 2 years ago
- ☆61Updated 2 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆63Updated 5 months ago
- PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification☆28Updated 5 years ago
- An unofficial implementation of DeepVQE proposed by Microsoft Corp.☆123Updated 10 months ago
- ☆25Updated 2 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Updated last year
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆158Updated 3 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆152Updated 3 years ago
- ☆119Updated 2 years ago
- ☆24Updated last year
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆106Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆55Updated 7 months ago
- 语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download☆55Updated 3 years ago
- Official data preparation scripts for the URGENT 2024 Challenge☆87Updated 8 months ago
- ☆41Updated last year
- ☆31Updated 3 years ago
- Official Repository for "SingFake: Singing Voice Deepfake Detection"☆63Updated last year
- Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications☆45Updated 3 years ago
- ☆62Updated last year
- ☆66Updated 2 years ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆55Updated 9 months ago
- ☆18Updated last year
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆101Updated 4 years ago
- PAM is a no-reference audio quality metric for audio generation tasks☆77Updated last year
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆129Updated 2 years ago
- Speech Separation☆78Updated last year
- ☆35Updated last year
- ☆15Updated 2 years ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆45Updated 2 months ago