facebookresearch / BinauralSpeechSynthesisLinks
N/A
☆174Updated 3 years ago
Alternatives and similar repositories for BinauralSpeechSynthesis
Users that are interested in BinauralSpeechSynthesis are comparing it to the libraries listed below
Sorting:
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆157Updated 2 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆68Updated 2 years ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- ☆64Updated last year
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆188Updated 2 years ago
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆107Updated 3 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆149Updated 3 years ago
- PAM is a no-reference audio quality metric for audio generation tasks☆64Updated 11 months ago
- ☆59Updated 4 years ago
- Yin pitch estimator in PyTorch☆114Updated 2 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆165Updated last year
- A repository for benchmarking neural vocoders by their quality and speed.☆210Updated 3 weeks ago
- STOI loss function in PyTorch☆91Updated 8 months ago
- Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - un…☆61Updated 5 years ago
- Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals☆50Updated 5 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆81Updated 3 years ago
- A pytroch implementation of the FB-MelGAN☆89Updated 5 years ago
- Self-supervised VQ-VAE for One-Shot Music Style Transfer☆95Updated 4 months ago
- Official implementation of SpeechSplit2☆132Updated 2 years ago
- ☆99Updated 3 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Updated 4 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆106Updated 3 years ago
- ☆69Updated 4 years ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆69Updated 2 years ago
- The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.☆126Updated 3 years ago
- ☆65Updated last year
- The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmente…☆119Updated last year
- ☆87Updated 2 years ago
- An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"☆125Updated 4 years ago
- The official source code of UniAudio☆95Updated last year