cmpute / audio-codec-benchmark
Comprehensive quantitative comparison of lossless and lossy audio codecs
☆34Updated 2 years ago
Alternatives and similar repositories for audio-codec-benchmark:
Users that are interested in audio-codec-benchmark are comparing it to the libraries listed below
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 6 months ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 6 months ago
- Streaming Vocos☆19Updated last month
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆42Updated 4 months ago
- source code of EfficientTTS 2☆12Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated last year
- A spoken version of the textual story cloze benchmark☆14Updated last year
- 60k hours of phoneme-aligned audio from audio books☆18Updated 6 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- Production-ready vocoder using BigVSAN☆11Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- ☆10Updated 3 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆15Updated 3 months ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated 7 months ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 5 months ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆29Updated 6 months ago
- ☆19Updated 10 months ago
- Official implementation of Self-Remixing☆13Updated last year
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆15Updated 6 months ago
- Prosodic Speech Segmentation with Transformers☆25Updated 11 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆17Updated 10 months ago
- Digital Speech Processing in PyTorch.☆14Updated 2 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Updated last year
- Sequence alignement methods with helpers for PyTorch.☆24Updated 2 years ago
- iSeparate library for the SDX2023 challenge☆13Updated last year
- Reimplementation of Miipher☆20Updated last year
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆52Updated 2 years ago