cmpute / audio-codec-benchmark
Comprehensive quantitative comparison of lossless and lossy audio codecs
☆36Updated 2 years ago
Alternatives and similar repositories for audio-codec-benchmark:
Users that are interested in audio-codec-benchmark are comparing it to the libraries listed below
- ☆41Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆18Updated 2 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆20Updated last year
- Just another FastSpeech 2 but cleaner code :)☆26Updated 9 months ago
- Speech Resynthesis and Language Modeling Using Flow Matching and Llama☆17Updated this week
- Paper, Code and Statistics for Speech Generatation.☆10Updated 2 years ago
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆13Updated 2 years ago
- ☆16Updated 7 months ago
- ☆13Updated last year
- Reimplementation of Miipher☆20Updated last year
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- ☆13Updated 6 months ago
- ☆13Updated 7 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 9 months ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆14Updated 2 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Updated 2 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆33Updated last year
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆22Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆15Updated 4 months ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated last year
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 6 months ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Updated last year
- Alignment examples for Interspeech 2024☆20Updated 9 months ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Updated 2 years ago
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆28Updated last year
- ☆17Updated 3 years ago
- ☆23Updated 2 years ago
- End-to-End SpeechSynthesis system with fastspeech2 & hifigan☆13Updated 2 years ago