midas-research / DECALinks
Data Extension and Class Addition for VSR
☆8Updated 4 years ago
Alternatives and similar repositories for DECA
Users that are interested in DECA are comparing it to the libraries listed below
Sorting:
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆35Updated 6 years ago
- This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders…☆53Updated 7 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 2 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Updated 6 years ago
- Network specification and demo☆35Updated 8 years ago
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆23Updated 5 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Updated 5 years ago
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 7 years ago
- This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and…☆52Updated 7 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- Torch code for using Residual Networks with LSTMs for Lipreading☆98Updated 6 years ago
- Time Delayed NN implemented in pytorch☆81Updated 8 years ago
- Audio-Visual Speech Recognition using Deep Learning☆60Updated 6 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- A Pytorch Implementation of MelGAN☆67Updated 5 years ago
- Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acousti…☆14Updated 4 years ago
- ☆31Updated 6 years ago
- Representation learning for NLP @ JSALT19☆39Updated 4 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Updated 7 years ago
- Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Updated 6 years ago
- ☆56Updated 6 years ago
- ☆59Updated 7 years ago
- SVHF-Net for Cross-modal binary matching☆32Updated 6 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Updated 4 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆34Updated 7 years ago
- Language modelling for sound event detection☆20Updated 5 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 5 years ago