pawel-kaczmarek / The-A-Files
Implementations of audio watermarking methods, speech quality metrics and attacks in different domains.
☆16Updated last week
Related projects ⓘ
Alternatives and complementary repositories for The-A-Files
- This repository contains official pytorch implementation and pre-trained models for the MR-RawNet.☆10Updated 4 months ago
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆16Updated this week
- Algorithm for blind estimation of reverberation time☆15Updated 5 months ago
- The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…☆11Updated last year
- ☆12Updated 8 months ago
- Viterbi decoding in PyTorch☆26Updated last month
- ☆20Updated 3 months ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆21Updated last month
- ☆12Updated 3 months ago
- ESLTTS dataset☆16Updated 4 months ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆21Updated last month
- ☆10Updated 2 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 3 months ago
- ☆13Updated 9 months ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆16Updated last year
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆18Updated 10 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆30Updated 3 months ago
- HAAQI-Net is a novel DNN-based non-intrusive method for assessing music audio quality in hearing aid users.☆9Updated 9 months ago
- ☆13Updated last month
- Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.☆26Updated this week
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆10Updated 10 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆13Updated last month
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆33Updated 2 years ago
- Source code for DM-Codec.☆18Updated 3 weeks ago
- ☆20Updated last year
- ☆20Updated 10 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆44Updated this week
- ☆14Updated last month
- Crowdsourced and Automatic Speech Prominence Estimation☆13Updated 6 months ago
- ☆18Updated 2 months ago