EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆12Nov 7, 2023Updated 2 years ago
Alternatives and similar repositories for MLASK
Users that are interested in MLASK are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆37Jan 29, 2025Updated last year
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- ☆14Jun 17, 2024Updated last year
- The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)☆85Apr 24, 2023Updated 2 years ago
- "Can images help recognize entities? A study of the role of images for Multimodal NER" (W-NUT at EMNLP 2021)☆21Nov 14, 2021Updated 4 years ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Jul 30, 2021Updated 4 years ago
- Multimodal summarization of user-generated videos from wearable cameras☆23Jun 22, 2025Updated 8 months ago
- [ICLR 2025] Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding☆40Mar 18, 2025Updated 11 months ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Dec 8, 2022Updated 3 years ago
- Platform API Project seed☆12Nov 8, 2023Updated 2 years ago
- ☆10Jun 6, 2024Updated last year
- An open-ended, self-improving AI system that evolves its own source code using a local LLM. Built for autonomy, reflection, and code evol…☆22Jan 24, 2026Updated last month
- cross modal background suppression for audio-visual event localization☆36Mar 18, 2022Updated 3 years ago
- Speech understanding system training toolkit, including tasks of ASR, SSL, LM, etc.☆11Feb 12, 2026Updated 2 weeks ago
- Automatic Thief Detection via CCTV with Alarm System and Perpetrator Image Capture using YOLOv5 + ROI. This project utilizes computer vis…☆14Oct 21, 2024Updated last year
- ☆13Oct 8, 2024Updated last year
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 8 months ago
- ☆11Aug 11, 2023Updated 2 years ago
- Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization☆12Dec 3, 2024Updated last year
- Application for Agent re-engineering for better and reliable Gen AI workflows.☆10Jul 20, 2025Updated 7 months ago
- This repo contains documentation related to the operation of the OpenBytes project.☆13Oct 29, 2021Updated 4 years ago
- Evaluation of Oasis Platform - simple install, UI and API☆14Feb 9, 2026Updated 3 weeks ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- Q-HEART: ECG Question Answering via Knowledge-Informed Multimodal LLMs (ECAI 2025)☆14Jan 23, 2026Updated last month
- A relatively simple, unified method for reporting on Kubernetes resource issues.☆12Mar 5, 2020Updated 5 years ago
- ☆14Dec 3, 2025Updated 2 months ago
- ☆11Oct 24, 2022Updated 3 years ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago
- We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…☆11Jul 24, 2024Updated last year
- ☆12Nov 21, 2023Updated 2 years ago
- This is a simple example of how to serve a DeepSeek model with Azure ML.☆10Feb 10, 2025Updated last year
- Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021☆11Aug 24, 2021Updated 4 years ago
- Assist Non-native Viewers: Multimodal Crosslingual Summarization for How2 Videos☆10Sep 2, 2024Updated last year
- 最大开源中文问答数据集 ,助力中文LLM.The largest open-source Chinese Q&A dataset, supporting Chinese LLM☆10Jul 31, 2023Updated 2 years ago
- Toward Practical Entity Alignment Method Design: Insights from New Highly Heterogeneous Knowledge Graph Datasets☆17Feb 18, 2025Updated last year
- ☆10Feb 16, 2025Updated last year
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ECG reconstruction☆14Nov 29, 2023Updated 2 years ago