The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆11May 16, 2023Updated 2 years ago
Alternatives and similar repositories for SOV-MAS
Users that are interested in SOV-MAS are comparing it to the libraries listed below
Sorting:
- Code for ACL 2023 paper: Exploring Better Text Image Translation with Multimodal Codebook☆21May 12, 2025Updated 9 months ago
- Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain Videos☆12Oct 8, 2020Updated 5 years ago
- Assist Non-native Viewers: Multimodal Crosslingual Summarization for How2 Videos☆10Sep 2, 2024Updated last year
- EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"☆12Nov 7, 2023Updated 2 years ago
- Implementation of DTMT with incremental decoding☆13Feb 20, 2021Updated 5 years ago
- "Can images help recognize entities? A study of the role of images for Multimodal NER" (W-NUT at EMNLP 2021)☆21Nov 14, 2021Updated 4 years ago
- Code for the ACL2022 main conference paper "A Variational Hierarchical Model for Neural Cross-Lingual Summarization"☆18Sep 5, 2022Updated 3 years ago
- ESPER☆24Mar 29, 2024Updated last year
- ☆31Apr 21, 2023Updated 2 years ago
- EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization☆36Jan 13, 2024Updated 2 years ago
- 吴恩达《机器学习》课后习题 Python 版 These are Exercises for Coursera's MachineLearning (by Andrew Ng) by Python.☆11Oct 26, 2018Updated 7 years ago
- cross modal background suppression for audio-visual event localization☆36Mar 18, 2022Updated 3 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…☆10Nov 4, 2022Updated 3 years ago
- ☆10May 1, 2025Updated 10 months ago
- autoHeightTextarea自适应高度的textarea是一款jquery插件,支持链式调用,支持设置最小行数、最小高度、最大行数和最大高度,在输入文字的时候实现textarea的高度自适应。http://www.fxss5201.cn/project/html/t…☆13Dec 18, 2018Updated 7 years ago
- Speech understanding system training toolkit, including tasks of ASR, SSL, LM, etc.☆11Feb 12, 2026Updated 2 weeks ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Mar 8, 2023Updated 2 years ago
- Enhancing Domain Adaptation through Prompt Gradient Alignment (NeurIPS 2024)☆14Jun 16, 2024Updated last year
- Federated Meta-Learning for Emotion and Sentiment Aware Multi-modal Complaint Identification☆10May 30, 2024Updated last year
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Jul 30, 2021Updated 4 years ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Oct 6, 2023Updated 2 years ago
- ☆10Jul 16, 2024Updated last year
- We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…☆11Jul 24, 2024Updated last year
- This is an official implementation in PyTorch of PTH-Net: Dynamic Facial Expression Recognition without Face Detection and Alignment..☆13Jul 1, 2025Updated 8 months ago
- ☆10Oct 16, 2025Updated 4 months ago
- [AAAI 2023] Official implementation of FiTs: Fine-grained Two-stage Training for Knowledge Base Question Answering☆11Mar 10, 2023Updated 2 years ago
- Implementation of Variational Hierarchical User-based Conversation Model☆10Jul 2, 2021Updated 4 years ago
- Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…☆11Jun 19, 2024Updated last year
- [NeurIPS 2025] Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling☆24Dec 16, 2025Updated 2 months ago
- Code for the ICLR'24 paper: MT-RANKER : Reference-free machine translation evaluation by inter-system ranking☆10Feb 29, 2024Updated 2 years ago
- ☆10Jan 18, 2024Updated 2 years ago
- Logo detection in images using SSD☆10Jul 13, 2018Updated 7 years ago
- Visual Question Generation☆11Aug 20, 2024Updated last year
- The official repository for the paper entitled "Time Travel in LLMs: Tracing Data Contamination in Large Language Models."☆12Jun 11, 2024Updated last year
- 中国科学院大学,601高等数学甲,历年考研真题收集整理☆11Aug 4, 2025Updated 6 months ago
- Awesome Multimodal Fusion in Speech Emotion Recognition☆13Nov 11, 2025Updated 3 months ago
- Substitute alternative spellings of special characters (e.g. German umlauts [ae, oe, ue] and [ss]) with their correct versions (ä, ö, ü, …☆11Nov 24, 2024Updated last year
- 2021 QQ浏览器ai算法大赛 赛道一 决赛第17名☆17Oct 25, 2022Updated 3 years ago