soham97 / PAM
PAM is a no-reference audio quality metric for audio generation tasks
☆49Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for PAM
- ☆47Updated last week
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆38Updated last month
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆72Updated 2 months ago
- ☆40Updated 5 months ago
- ☆55Updated 11 months ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆67Updated 2 weeks ago
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 3 weeks ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆84Updated 2 months ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆78Updated 4 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆50Updated 2 weeks ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆41Updated last week
- ☆34Updated 5 months ago
- NOMAD: Non-Matching Audio Distance (ICASSP 2024)☆24Updated last month
- ☆45Updated this week
- The open source code for SimpleSpeech series☆111Updated last month
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated 9 months ago
- ☆30Updated last year
- Generation scripts for EARS-WHAM and EARS-Reverb☆23Updated 2 months ago
- Implementation of SpatialCodec.☆55Updated last year
- Query-conditioned target sound extraction model☆18Updated 3 weeks ago
- ☆21Updated 6 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆63Updated 2 months ago
- ☆48Updated last year
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆54Updated 7 months ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆33Updated 3 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- ☆62Updated 10 months ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆58Updated 7 months ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆122Updated 5 months ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆69Updated last month