srvk / how2-dataset
This repository contains code and metadata of How2 dataset
☆160Updated last month
Related projects: ⓘ
- Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)☆98Updated last year
- Zero -- A neural machine translation system☆148Updated last year
- Multilingual speech translation☆41Updated 3 years ago
- Speech2vec pre-trained word vectors☆77Updated 6 years ago
- Code and dataset of "MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos" in MM'20.☆50Updated last year
- Tracking the progress in non-autoregressive generation (translation, transcription, etc.)☆305Updated last year
- ☆51Updated 2 years ago
- ☆40Updated this week
- Implementation of meta-transfer-learning for ASR and LM (ACL 2020)☆49Updated 4 years ago
- ☆10Updated this week
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆54Updated 2 years ago
- ☆53Updated 4 years ago
- This is the official code repository for the paper 'Cross-modality Data Augmentation for End-to-End Sign Language Translation'. Accepted…☆13Updated 11 months ago
- Multi-modal Neural Machine Translation in PyTorch☆44Updated 6 years ago
- Code, Models and Datasets for OpenViDial Dataset☆131Updated 2 years ago
- 🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps☆134Updated 4 months ago
- Tracking the progress in end-to-end speech translation☆249Updated 10 months ago
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆98Updated last week
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆49Updated 2 years ago
- Code base for the paper "Latent variable model for multi-modal translation".☆16Updated last month
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Updated 2 years ago
- ☆174Updated 2 years ago
- ☆26Updated 2 years ago
- A curated list of AWESOME papers, datasets and tutorials within Multimodal Machine Translation.☆35Updated 3 years ago
- Temporal Reasoning via Audio Question Answering☆20Updated 4 years ago
- Cross-lingual Visual Pre-training for Multimodal Machine Translation☆18Updated 2 years ago
- Code and Data for the ACL22 main conference paper "MSCTD: A Multimodal Sentiment Chat Translation Dataset"☆40Updated last year
- Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch☆58Updated 4 years ago
- ☆22Updated 3 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆46Updated 2 years ago