srvk / how2-datasetLinks
This repository contains code and metadata of How2 dataset
☆181Updated 8 months ago
Alternatives and similar repositories for how2-dataset
Users that are interested in how2-dataset are comparing it to the libraries listed below
Sorting:
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆55Updated 3 years ago
- ☆53Updated 5 years ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆52Updated 3 years ago
- Multi-modal Neural Machine Translation in PyTorch☆44Updated 7 years ago
- Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)☆100Updated 2 years ago
- ☆53Updated 3 years ago
- Code and dataset of "MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos" in MM'20.☆53Updated 2 years ago
- Implementation of meta-transfer-learning for ASR and LM (ACL 2020)☆50Updated 5 years ago
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Updated 3 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆27Updated 3 years ago
- Speech2vec pre-trained word vectors☆76Updated 6 years ago
- Zero -- A neural machine translation system☆153Updated 2 years ago
- ☆27Updated 5 years ago
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆143Updated 3 years ago
- Tracking the progress in non-autoregressive generation (translation, transcription, etc.)☆306Updated 2 years ago
- Multilingual speech translation☆41Updated 4 years ago
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆64Updated 3 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆19Updated 8 months ago
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆18Updated 3 years ago
- ☆179Updated 3 years ago
- Implementation of "Audio Retrieval with Natural Language Queries", INTERSPEECH 2021, PyTorch☆27Updated 2 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆48Updated 3 years ago
- Code for ACL 2022 main conference paper "Neural Machine Translation with Phrase-Level Universal Visual Representations".☆21Updated last year
- Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".☆36Updated last year
- EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset☆57Updated 4 years ago
- A curated list of AWESOME papers, datasets and tutorials within Multimodal Machine Translation.☆36Updated 3 years ago
- ☆15Updated 4 years ago
- Cross-lingual Visual Pre-training for Multimodal Machine Translation☆18Updated 3 years ago
- This code repository is for the accepted ACL2022 paper "On Vision Features in Multimodal Machine Translation". We provide the details and…☆43Updated 2 years ago
- Code and Data for the ACL22 main conference paper "MSCTD: A Multimodal Sentiment Chat Translation Dataset"☆41Updated 8 months ago