xinshengwang / S2IGANLinks
Pytorch Code for S2IGAN
☆41Updated 4 years ago
Alternatives and similar repositories for S2IGAN
Users that are interested in S2IGAN are comparing it to the libraries listed below
Sorting:
- Speech-conditioned face generation using Generative Adversarial Networks☆88Updated 2 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆91Updated 3 years ago
- Implementation of Differential Learning Rate in Keras☆11Updated 6 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆121Updated 2 years ago
- Utils and data sets for audio and PyTorch☆85Updated 3 years ago
- Implementation of Multistream Transformers in Pytorch☆54Updated 3 years ago
- ☆27Updated 6 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆44Updated 4 years ago
- LipNet with gluon☆23Updated 2 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆23Updated 5 years ago
- Comprehensive Python library for speech and voice.☆32Updated 2 years ago
- Speech-conditioned face generation using Generative Adversarial Networks (ICASSP 2019)☆56Updated 3 years ago
- Two-stage GANs that generate fingerstyle guitarist images from audio.☆59Updated 6 years ago
- Source code for "Towards a Deeper Understanding of Adversarial Losses under a Discriminative Adversarial Network Setting"☆42Updated 2 years ago
- Contrastive Language-Audio Pretraining☆87Updated 3 years ago
- ☆24Updated 6 years ago
- ☆76Updated 3 years ago
- Official PyTorch implementation of TTS Style Transfer☆24Updated 3 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆59Updated 4 years ago
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- bumble bee transformer☆14Updated 4 years ago
- Code for paper "direct speech-to-image translation"☆27Updated 5 years ago
- Implementations of various GAN architectures using PyTorch Lightning☆26Updated 4 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated 2 years ago
- ☆16Updated 3 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Updated 4 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆72Updated 6 years ago
- ☆25Updated 6 years ago