Skyline-9/Visionary-Vids

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Skyline-9/Visionary-Vids)

Skyline-9 / Visionary-Vids

Multi-modal transformer approach for natural language query based joint video summarization and highlight detection

☆17

Alternatives and similar repositories for Visionary-Vids

Users that are interested in Visionary-Vids are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

e-apostolidis / XAI-SUM
View on GitHub
A PyTorch implementation of the software used in: "A study on the use of attention for explaining video summarization" (NarSUM Workshop a…
☆11Oct 20, 2023Updated 2 years ago
boheumd / A2Summ
View on GitHub
The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)
☆86Apr 24, 2023Updated 3 years ago
fjchange / Awesome_Video_Summarization
View on GitHub
Papers, codes collection of video summarization / video highlight detection / video key frame selection
☆37Jul 16, 2021Updated 5 years ago
BriansIDP / AudioVisualLLM
View on GitHub
☆19May 19, 2024Updated 2 years ago
StevRamos / video_summarization
View on GitHub
A computing solution based on deep learning that allows the efficient generation of keyshot type spotlights from videos.
☆19Jan 13, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
wjun0830 / QD-DETR
View on GitHub
Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 …
☆251Aug 12, 2025Updated 11 months ago
GeWu-Lab / LFAV
View on GitHub
Towards Long Form Audio-visual Video Understanding
☆15Jan 16, 2026Updated 6 months ago
j-min / HiREST
View on GitHub
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
☆110Jan 23, 2025Updated last year
intel / TVP
View on GitHub
☆15Aug 4, 2025Updated 11 months ago
e-apostolidis / CA-SUM
View on GitHub
A PyTorch Implementation of CA-SUM from "Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of …
☆31Jun 29, 2022Updated 4 years ago
wjun0830 / CGDETR
View on GitHub
Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…
☆154Aug 21, 2024Updated last year
MGitHubL / TMac
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
medhini / Instructional-Video-Summarization
View on GitHub
Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022
☆39Feb 17, 2023Updated 3 years ago
pcshih / pytorch-VSLUD
View on GitHub
This is the implementation of the paper Video Summarization by Learning from Unpaired Data(CVPR2019)
☆37Sep 5, 2019Updated 6 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
zyWang-Power / TUDA
View on GitHub
☆10Apr 20, 2023Updated 3 years ago
wlzhang2020 / LLMTreeRec
View on GitHub
The implement of LLMTreeRec
☆14Dec 9, 2024Updated last year
Jielin-Qiu / MMSum_model
View on GitHub
[CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
☆38Jan 29, 2025Updated last year
IntelLabs / GraVi-T
View on GitHub
Graph learning framework for long-term video understanding
☆72Jul 13, 2026Updated 2 weeks ago
caravagnalab / rcongas
View on GitHub
Total copy number inference from single-cell RNA and ATAC sequing with cell clustering
☆12Oct 31, 2024Updated last year
ChrisAllenMing / Cross_Category_Video_Highlight
View on GitHub
Implementation of Cross-category Video Highlight Detection via Set-based Learning (ICCV 2021).
☆81Aug 27, 2021Updated 4 years ago
josephdviviano / whatsinthebox
View on GitHub
analysis of public NLP corpora
☆11Feb 9, 2023Updated 3 years ago
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
ewwink / wikipedia-wordlists-extractor
View on GitHub
Extract Unique Word Lists From Wikipedia Database
☆13May 27, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
guotaowang / STANet
View on GitHub
☆16Sep 20, 2022Updated 3 years ago
XL2248 / SOV-MAS
View on GitHub
The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆11May 16, 2023Updated 3 years ago
btelle / podcasts-dataset
View on GitHub
dataset of podcasts and episodes
☆14Jan 16, 2018Updated 8 years ago
poryfly / AudioFeExtr
View on GitHub
音频特征提取程序，MFCC,HFCC,MFCC_WALSH，Philips
☆31Mar 31, 2019Updated 7 years ago
TIBHannover / MSVA
View on GitHub
Deep learning model for supervised video summarization called Multi Source Visual Attention (MSVA)
☆47Mar 21, 2024Updated 2 years ago
fordevoted / UIESS
View on GitHub
Official Code for the paper Domain Adaptation for Underwater Image Enhancement via Content and Style Separation.( IEEE Access 2022)
☆11Nov 7, 2022Updated 3 years ago
MengboLi / MS-SENet
View on GitHub
☆11Jul 16, 2024Updated 2 years ago
km1994 / nlp_paper_study_search_engine
View on GitHub
该仓库主要记录 NLP 算法工程师相关的搜索引擎学习笔记
☆14Apr 9, 2022Updated 4 years ago
shivamjain1 / Car-Racer-Vanilla-JS
View on GitHub
Car Racer Game in Vanilla JavaScript
☆14Mar 31, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
saakur / EventSegmentation
View on GitHub
Code for CVPR 2019 paper
☆12Apr 26, 2019Updated 7 years ago
mengshiY / RCSF
View on GitHub
Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021
☆11Aug 24, 2021Updated 4 years ago
e-apostolidis / PoR-Summarization-Measure
View on GitHub
A python implementation for computing the PoR metric for video summarization from "Performance over Random: A Robust Evaluation Protocol …
☆10May 4, 2022Updated 4 years ago
Serega6678 / NuNER
View on GitHub
NuNER is the family of SOTA Foundation and Zero-shot for Entity Recognition
☆15Jun 11, 2024Updated 2 years ago
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
yangjingyuan / ConstDecoder
View on GitHub
☆11Oct 24, 2022Updated 3 years ago
wuzhe71 / STGN
View on GitHub
☆12Nov 4, 2022Updated 3 years ago