r10a / music-speech-classifierLinks
Aim to implement a classifier which classifies an audio sample into speech or music.
☆9Updated 5 years ago
Alternatives and similar repositories for music-speech-classifier
Users that are interested in music-speech-classifier are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification☆28Updated 4 years ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- Objective measures of speech quality SNR☆18Updated 5 years ago
- US-based professors who work on audio. For students who would like to apply for RA, PhD, postdoc in audio research.☆26Updated 2 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- ☆29Updated 2 years ago
- An unofficial implementation of DeepVQE proposed by Microsoft Corp.☆88Updated 2 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆64Updated 10 months ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆19Updated last year
- Official data preparation scripts for the URGENT 2024 Challenge☆77Updated 2 weeks ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Updated 8 months ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- ☆65Updated last year
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆86Updated 2 years ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆23Updated 2 years ago
- This code is to run the WARP-Q speech quality metric.☆35Updated 7 months ago
- Implementation of SpatialCodec.☆58Updated last year
- ☆69Updated 4 years ago
- PyTorch implementation of LiMuSE☆31Updated 2 years ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆46Updated last month
- ☆10Updated 2 years ago
- ☆81Updated 11 months ago
- Query-conditioned target sound extraction model☆23Updated 2 months ago
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆25Updated 4 years ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆68Updated last year
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆16Updated 2 years ago
- The official PyTorch implementation of paper: An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmen…☆9Updated 3 years ago
- ☆56Updated last year
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆32Updated last year