Multi-modal transformer approach for natural language query based joint video summarization and highlight detection
☆17May 23, 2024Updated last year
Alternatives and similar repositories for Visionary-Vids
Users that are interested in Visionary-Vids are comparing it to the libraries listed below
Sorting:
- A PyTorch implementation of the software used in: "A study on the use of attention for explaining video summarization" (NarSUM Workshop a…☆11Oct 20, 2023Updated 2 years ago
- ☆15Aug 4, 2025Updated 7 months ago
- The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)☆86Apr 24, 2023Updated 2 years ago
- ☆19May 19, 2024Updated last year
- Towards Long Form Audio-visual Video Understanding☆15Jan 16, 2026Updated last month
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…☆151Aug 21, 2024Updated last year
- Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 …☆246Aug 12, 2025Updated 6 months ago
- Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)☆107Jan 23, 2025Updated last year
- Best API Media Service to make your code more better.☆11Feb 26, 2023Updated 3 years ago
- SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation (ICCV 2025)☆14Sep 26, 2025Updated 5 months ago
- Example application for creating an MVC Express + Node + TypeScript app and deploying it to Azure☆10Nov 8, 2018Updated 7 years ago
- ☆40Apr 16, 2024Updated last year
- 📦 A collection of pastable code gathered from past projects☆12Sep 9, 2024Updated last year
- ☆34Jun 2, 2023Updated 2 years ago
- cross modal background suppression for audio-visual event localization☆36Mar 18, 2022Updated 3 years ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Jul 31, 2024Updated last year
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆37Jan 29, 2025Updated last year
- [NeurIPS 2021] Moment-DETR code and QVHighlights dataset☆344Apr 18, 2024Updated last year
- This is the implementation of the paper Video Summarization by Learning from Unpaired Data(CVPR2019)☆37Sep 5, 2019Updated 6 years ago
- Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022☆39Feb 17, 2023Updated 3 years ago
- This repository contains the codebase for MovieCLIP: Visual Scene Recognition in Movies☆42Oct 1, 2023Updated 2 years ago
- An inventory management system is built with PHP And MySQL to helps businesses effectively manage their inventory or stock of products.☆16Jun 3, 2024Updated last year
- SKFAC Preconditioner for MindSpore☆12Jul 2, 2021Updated 4 years ago
- ☆11Apr 20, 2023Updated 2 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 26, 2026Updated last week
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- Edge Impulse FOMO Implementation from scratch☆18Feb 27, 2026Updated last week
- Total copy number inference from single-cell RNA and ATAC sequing with cell clustering☆11Oct 31, 2024Updated last year
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago
- ☆13Dec 8, 2022Updated 3 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 10 months ago
- [ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Grounding☆376May 8, 2024Updated last year
- Basic rover demo from Raspberry Pi with remote teleop over LiveKit☆15Jul 10, 2025Updated 7 months ago
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- Paytoshi Faucet script☆11May 31, 2016Updated 9 years ago
- Trace a bitmap and project the resulting polygons into OpenSCAD polyhedrons.☆18Mar 14, 2015Updated 10 years ago
- A helper to compare and identify similar keywords using PHP.☆10May 28, 2023Updated 2 years ago
- My personal website☆11Jan 3, 2023Updated 3 years ago