EricLee8 / MPD_EMVILinks
Official implementation of our paper at ACL 2023: Pre-training Multi-party Dialogue Models with Latent Discourse Inference
☆10Updated 2 years ago
Alternatives and similar repositories for MPD_EMVI
Users that are interested in MPD_EMVI are comparing it to the libraries listed below
Sorting:
- Code for EMNLP 2022 main conference paper "Low-resource Neural Machine Translation with Cross-modal Alignment".☆14Updated 2 years ago
- Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".☆36Updated last year
- ☆183Updated last year
- EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning [🔥The Exploration of R1 for General Audio-Vi…☆53Updated 3 months ago
- Code for ACL 2022 main conference paper "Neural Machine Translation with Phrase-Level Universal Visual Representations".☆21Updated last year
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆18Updated 3 years ago
- This code repository is for the accepted ACL2022 paper "On Vision Features in Multimodal Machine Translation". We provide the details and…☆43Updated 2 years ago
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆64Updated 3 years ago
- The official site of paper MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation☆198Updated last year
- ☆22Updated 4 years ago
- Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"☆18Updated 10 months ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆48Updated 3 years ago
- Official Implementation for the ICML2022 paper "Directed Acyclic Transformer for Non-Autoregressive Machine Translation"☆127Updated last year
- [ACM MM 2022]: Multi-Modal Experience Inspired AI Creation☆20Updated 9 months ago
- A benchmark for the task of translation suggestion☆60Updated 3 years ago
- Implementation of our ACL2023 paper: Unifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Langua…☆19Updated 2 years ago
- ☆59Updated 2 years ago
- Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval…☆21Updated 2 years ago
- [ICASSP2024] Code for paper "SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection"☆11Updated last year
- A project for tri-modal LLM benchmarking and instruction tuning.☆42Updated 5 months ago
- Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)☆14Updated 2 years ago
- This setup allows to train end-to-end neural models for spoken language understanding (SLU).☆11Updated 2 years ago
- A paper list about diffusion models for natural language processing.☆182Updated 2 years ago
- ☆40Updated last year
- ☆22Updated 4 months ago
- ☆40Updated last year
- a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.☆25Updated 4 months ago
- [AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning☆15Updated last year
- Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"☆12Updated 3 years ago
- Code for EMNLP 2023 findings paper "A Closer Look into Using Large Language Models for Automatic Evaluation"☆18Updated last year