klauscc / lipnet-replicationLinks
A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading
☆28Updated 7 years ago
Alternatives and similar repositories for lipnet-replication
Users that are interested in lipnet-replication are comparing it to the libraries listed below
Sorting:
- Representations of language in a model of visually grounded speech signal.☆23Updated 7 years ago
- Attention Bidirectional Video Recurrent Net☆56Updated 6 years ago
- Attempts to understand deep learning and the Tensorflow RNN api by implementing a (very)crude version of the google DeViSE paper(2013).☆7Updated 8 years ago
- AENet: audio feature extraction☆60Updated 5 years ago
- Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)☆43Updated 7 years ago
- Code to accompany the paper "Learning Grimaces By Watching TV" and FaceValue dataset☆12Updated 6 years ago
- Various implementations and experimentation for deep neural network model compression☆24Updated 6 years ago
- https://arxiv.org/abs/1707.00836☆21Updated 7 years ago
- ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and Ex…☆26Updated 6 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- Converts TensorFlow checkpoints (with index, meta and data files) to PyTorch, HDF5 and JSON☆18Updated 4 years ago
- code for triplet GAN☆31Updated 7 years ago
- Automatic Query Image Disambiguation (AID)☆11Updated 7 years ago
- ☆29Updated 8 years ago
- Implementation for the CVPR2019 paper "Graphical Contrastive Losses for Scene Graph Parsing"☆12Updated 5 years ago
- We propose a new variant GAN model to deal with image generation and transformation,especially in facial attributes area.☆12Updated 7 years ago
- Stochastic Adaptive Neural Architecture Search☆65Updated 6 years ago
- a list of recent papers on transfer learning☆24Updated 7 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆14Updated 9 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 8 years ago
- Scripts to extract CNN features from video frames with Keras.☆24Updated 8 years ago
- ☆48Updated last year
- Pytorch implementation of pixel level domain transfer☆9Updated 7 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Updated 2 years ago
- Fast-Slow Recurrent Neural Networks☆14Updated 7 years ago
- ☆11Updated 7 years ago
- Modular and Simple approach to VQA in Keras☆21Updated 7 years ago
- Video captioning using LSTM and CNN. This is the Visual Learning project done by Rui Zhang, Yujia Huang and Yu Zhang☆19Updated 9 years ago
- [CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog☆34Updated 4 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Updated 7 years ago