r10a / music-speech-classifierLinks
Aim to implement a classifier which classifies an audio sample into speech or music.
☆10Updated 6 years ago
Alternatives and similar repositories for music-speech-classifier
Users that are interested in music-speech-classifier are comparing it to the libraries listed below
Sorting:
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆98Updated 2 years ago
- An unofficial implementation of DeepVQE proposed by Microsoft Corp.☆117Updated 9 months ago
- ☆31Updated 3 years ago
- ☆119Updated 2 years ago
- speech enhancement\speech seperation\sound source localization☆15Updated 5 years ago
- Speech Separation☆78Updated last year
- ☆39Updated last year
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆104Updated 2 years ago
- ☆60Updated 2 years ago
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆127Updated 2 years ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆52Updated 8 months ago
- This repo contains required files for the INTERSPEECH 2022 Audio Deep Packet Loss Concealment (PLC) Challenge.☆89Updated 3 months ago
- Source code for Consistent ensemble distillation for audio tagging☆54Updated 6 months ago
- ☆25Updated last year
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆123Updated 3 years ago
- ☆15Updated 2 years ago
- STOI loss function in PyTorch☆101Updated last year
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆79Updated 7 months ago
- multi-scale time domain speaker extraction☆70Updated 4 years ago
- 语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download☆55Updated 3 years ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆146Updated 7 months ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆158Updated 3 years ago
- Official repository for FlowSE (Interspeech 2025)☆82Updated 5 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆76Updated last year
- Official data preparation scripts for the URGENT 2024 Challenge☆87Updated 7 months ago
- ☆61Updated last year
- A simple package for Guided source separation (GSS)☆132Updated last year
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆44Updated last month
- Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)☆92Updated 6 years ago
- AnyEnhance-based Baseline for the CCF-AATC 2025 Challenge Track 1☆40Updated last month