EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆12Nov 7, 2023Updated 2 years ago
Alternatives and similar repositories for MLASK
Users that are interested in MLASK are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆37Jan 29, 2025Updated last year
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Jul 30, 2021Updated 4 years ago
- The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)☆86Apr 24, 2023Updated 2 years ago
- Multimodal summarization of user-generated videos from wearable cameras☆23Jun 22, 2025Updated 9 months ago
- ☆14Dec 3, 2025Updated 3 months ago
- "Can images help recognize entities? A study of the role of images for Multimodal NER" (W-NUT at EMNLP 2021)☆21Nov 14, 2021Updated 4 years ago
- Toward Practical Entity Alignment Method Design: Insights from New Highly Heterogeneous Knowledge Graph Datasets☆17Feb 18, 2025Updated last year
- ☆14Jun 17, 2024Updated last year
- Steamのレビューを収集するアプリケーションです☆15Sep 4, 2022Updated 3 years ago
- ☆10Jun 6, 2024Updated last year
- 最大开源中文问答数据集 ,助力中文LLM.The largest open-source Chinese Q&A dataset, supporting Chinese LLM☆10Jul 31, 2023Updated 2 years ago
- Code for the paper "Automated Generation of Hospital Discharge Summaries Using Clinical Guidelines and Large Language Models"☆11May 3, 2024Updated last year
- Online Collaborative Topic Regression☆14Mar 11, 2018Updated 8 years ago
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆57Jan 14, 2022Updated 4 years ago
- This repo contains the Pytorch implementation of the AAAI'18 paper - Deep Reinforcement Learning for Unsupervised Video Summarization wit…☆11Jun 5, 2023Updated 2 years ago
- A Tor Browser crawler based on selenium and phantomjs, used for work on Website Fingerprinting (WFP) Attacks.☆11Aug 25, 2017Updated 8 years ago
- Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain Videos☆12Oct 8, 2020Updated 5 years ago
- ☆18Jun 4, 2023Updated 2 years ago
- fastText中文词向量训练调优,加权融合字向量和词向量,解决过度表征字面量而非语义的问题☆11Aug 3, 2020Updated 5 years ago
- ☆11Jun 5, 2021Updated 4 years ago
- ☆12Oct 8, 2024Updated last year
- Assist Non-native Viewers: Multimodal Crosslingual Summarization for How2 Videos☆10Sep 2, 2024Updated last year
- [MICCAI'22] Unsupervised Contrastive Learning on Gall Bladder Ultrasound Videos☆11May 28, 2023Updated 2 years ago
- simple DBMS,数据库概论的课程设计☆14Nov 30, 2018Updated 7 years ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- Pytorch implementation of Multimodal Neural Machine Translation(MNMT).☆12Jan 21, 2021Updated 5 years ago
- ☆19Oct 27, 2024Updated last year
- 🏓 A Ping Pong game written in VHDL with VGA support☆14Jun 2, 2019Updated 6 years ago
- Create plastic trash aerial image dataset - HAIDA☆15Jul 24, 2021Updated 4 years ago
- Implementation of "Multi-modal Retrieval Augmented Multi-modal Generation: Datasets, Evaluation Metrics and Strong Baselines"☆31Feb 24, 2025Updated last year
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Dec 8, 2022Updated 3 years ago
- Official implementation of “CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surg…☆17Jul 7, 2024Updated last year
- ☆15Jun 27, 2023Updated 2 years ago
- I fine-tuned (p-tuning) Tsinghua’s open-source large language model, ChatGLM2-6B, using several years of my WeChat chat history. Inspired…☆12Mar 6, 2024Updated 2 years ago
- Code and models for MICCAI23 paper: "Self-Supervised Learning for Endoscopy Video Analysis".☆22Oct 2, 2023Updated 2 years ago
- Data set for the IEEE TGRS paper "Mutual Attention Inception Network for Remote Sensing Visual Question Answering"☆22Nov 14, 2022Updated 3 years ago
- Implementation of LTC-SUM: Lightweight Client-driven Personalized Video Summarization Framework Using 2D CNN☆22Jul 11, 2023Updated 2 years ago
- Training a computer to write music☆20Oct 5, 2017Updated 8 years ago