facebookresearch / ears_dataset
Expressive Anechoic Recordings of Speech (EARS)
☆123Updated 2 months ago
Related projects: ⓘ
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆151Updated 2 years ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆66Updated 3 weeks ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆54Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆81Updated last month
- Evaluation and Benchmarking of Speech Super-resolution Methods☆133Updated 2 years ago
- Pytorch implementation of subband decomposition☆88Updated 2 years ago
- UTokyo-SaruLab MOS Prediction System☆49Updated this week
- Reference-aware automatic speech evaluation toolkit☆95Updated 6 months ago
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆112Updated 3 weeks ago
- MOS score prediction by fine-tuned wav2vec2.0 model☆135Updated last year
- PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio☆138Updated last year
- ☆63Updated last year
- Yin pitch estimator in PyTorch☆113Updated last year
- ☆72Updated last year
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆70Updated 2 months ago
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆108Updated 3 months ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆115Updated 3 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆42Updated 2 months ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆73Updated last year
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆26Updated last month
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆91Updated last year
- ☆92Updated this week
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆40Updated this week
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆187Updated last year
- ☆50Updated last year
- The open source code for SimpleSpeech series☆85Updated last month
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆63Updated 3 months ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆24Updated 3 months ago
- UT-Sarulab MOS prediction system using SSL models☆163Updated 5 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆54Updated 3 weeks ago