midas-research / DECALinks

Data Extension and Class Addition for VSR

☆8

Alternatives and similar repositories for DECA

Users that are interested in DECA are comparing it to the libraries listed below

Sorting:

shayangharib / AUDASC
Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification
☆35Updated 6 years ago
wnhsu / ScalableFHVAE
This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders…
☆53Updated 7 years ago
shaojinding / GroupLatentEmbedding
Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…
☆28Updated 5 years ago
matthijsvk / multimodalSR
Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.
☆69Updated 2 years ago
kastnerkyle / representation_mixing
Demos, pretrained models, and (WIP) code supporting Representation Mixing
☆51Updated 6 years ago
JeremyCCHsu / vc-vawgan
Network specification and demo
☆35Updated 8 years ago
bobchennan / sparse_image_warp_pytorch
Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…
☆23Updated 5 years ago
ankitshah009 / WALNet-Weak_Label_Analysis
Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.
☆32Updated last year
espnet / icassp2020-tts
ESPnet-TTS Audio Sample HP
☆21Updated 5 years ago
ondrejklejch / learning_to_adapt
Coordinate-wise meta-learner for speaker adaptation of ASR models.
☆20Updated 5 years ago
Kajiyu / LLLNet
Keras Implementation of "Look, Listen and Learn" Model
☆21Updated 7 years ago
wnhsu / SpeechVAE
This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and…
☆52Updated 7 years ago
swshon / dialectID_siam
Dialect identification using Siamese network
☆15Updated 7 years ago
tstafylakis / Lipreading-ResNet
Torch code for using Residual Networks with LSTMs for Lipreading
☆98Updated 6 years ago
kefirski / pytorch_TDNN
Time Delayed NN implemented in pytorch
☆81Updated 8 years ago
ajinkyaT / Lip_Reading_in_the_Wild_AVSR
Audio-Visual Speech Recognition using Deep Learning
☆60Updated 6 years ago
mravanelli / pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…
☆38Updated 7 years ago
jaywalnut310 / MelGAN-Pytorch
A Pytorch Implementation of MelGAN
☆67Updated 5 years ago
EIHW / Attention-based_Atrous_CNN
Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acousti…
☆14Updated 4 years ago
dhgrs / pytorch-UniWaveNet
☆31Updated 6 years ago
distsup / DistSup
Representation learning for NLP @ JSALT19
☆39Updated 4 years ago
karolpiczak / paper-2017-DCASE
The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data
☆39Updated 7 years ago
DeepLearn-lab / Acoustic-Feature-Fusion_Chime18
Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow
☆26Updated 6 years ago
HaiFengZeng / clari_wavenet_vocoder
☆56Updated 6 years ago
anuragkr90 / weak_feature_extractor
☆59Updated 7 years ago
a-nagrani / SVHF-Net
SVHF-Net for Cross-modal binary matching
☆32Updated 6 years ago
artbataev / end2end
Losses and decoders for end-to-end ASR and OCR
☆34Updated 4 years ago
AppleHolic / PytorchSR
Pytorch based phoneme recognition (TIMIT phoneme classification)
☆34Updated 7 years ago
dr-costas / SEDLM
Language modelling for sound event detection
☆20Updated 5 years ago
marc-moreaux / audioset_raw
Download and create a tfreader for the audioset dataset
☆16Updated 5 years ago