yochaiye / LipVoicer
Official Code implementation for the ICLR paper "LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading"
☆56Updated 4 months ago
Alternatives and similar repositories for LipVoicer:
Users that are interested in LipVoicer are comparing it to the libraries listed below
- ☆66Updated last week
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆98Updated 2 weeks ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆65Updated 3 months ago
- [Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Units☆27Updated 3 months ago
- The official implementation of EmoSphere++☆70Updated last week
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆140Updated 10 months ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆53Updated 7 months ago
- ☆65Updated last year
- [Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation☆105Updated last week
- Zero-Shot Emotion Style Transfer☆41Updated 9 months ago
- Official implementation of USR (NeurIPS 2024)☆28Updated last month
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆49Updated 3 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆79Updated 10 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- ☆56Updated 3 months ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆67Updated last month
- The official implementation of EmoSphere-TTS☆105Updated last week
- Diffusion Model for Voice Conversion☆46Updated 10 months ago
- ☆46Updated last year
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆48Updated 7 months ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆126Updated 4 months ago
- [Interspeech 2024] SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization☆40Updated last month
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆210Updated 6 months ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆145Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Updated last year
- Implementation of Emo-StarGAN☆46Updated last year
- Official implementation of SpeechSplit2☆130Updated 2 years ago
- ☆46Updated 2 years ago
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆47Updated last year