facebookresearch / facestar
Facestar dataset. High quality audio-visual recordings of human conversational speech.
☆99Updated 2 years ago
Related projects: ⓘ
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆53Updated last year
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆78Updated 2 years ago
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆38Updated last year
- ☆45Updated last year
- Transcribing Speech with Multinomial Diffusion, training code and models.☆74Updated 11 months ago
- LPC Utility for Pytorch Library.☆43Updated last month
- Training code and trained checkpoints for ASGAN.☆60Updated 8 months ago
- Official implementation of SpeechSplit2☆126Updated last year
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆56Updated last year
- N/A☆163Updated 2 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆159Updated 5 months ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆69Updated 3 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆36Updated 2 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆39Updated 3 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆108Updated last year
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- ☆15Updated 2 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated last year
- Demo audio of VARA-TTS model☆20Updated 3 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 2 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated 6 months ago
- ☆23Updated last month
- ☆56Updated last year
- ☆25Updated 5 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆66Updated 3 years ago
- Official release of StyleTalk dataset.☆53Updated 2 months ago
- Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.☆80Updated 11 months ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 3 years ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆30Updated 5 years ago