madhavlab / 2022_syncnetLinks
SyncNet for Time Synchronization
☆25Updated 2 years ago
Alternatives and similar repositories for 2022_syncnet
Users that are interested in 2022_syncnet are comparing it to the libraries listed below
Sorting:
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆28Updated 3 years ago
- PyTorch implementation of LiMuSE☆31Updated 2 years ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 2 years ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Updated 2 years ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆74Updated 2 weeks ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆46Updated last month
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆41Updated 6 months ago
- Dual-Path Attention and Recurrent Network for speech separation☆16Updated 8 months ago
- Streaming Audiotransformers for online Audio tagging☆44Updated 11 months ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- ☆20Updated 7 months ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32Updated 6 years ago
- singing voice conversion without f0☆23Updated 2 years ago
- The official PyTorch implementation of paper: An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmen…☆9Updated 3 years ago
- ☆62Updated last year
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆59Updated 2 weeks ago
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆38Updated 3 years ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated 9 months ago
- (TASLP 2022) Unsupervised speech enhancement using DVAEs☆21Updated 5 months ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 2 months ago
- Pytorch Models for Speech Enhancement☆20Updated 2 years ago
- ☆13Updated last year
- ☆12Updated 10 months ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated last year
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆32Updated last year
- a compact audio-to-phoneme aligner for singing voice☆10Updated last year
- Implementation of Emo-StarGAN☆45Updated last year
- ☆26Updated 3 years ago