EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important temporal segments in educational videos.
☆23Mar 8, 2024Updated last year
Alternatives and similar repositories for EDUVSUM
Users that are interested in EDUVSUM are comparing it to the libraries listed below
Sorting:
- ☆17Jul 25, 2025Updated 7 months ago
- ☆14Sep 20, 2025Updated 5 months ago
- PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)☆144Apr 8, 2023Updated 2 years ago
- MemRec☆37Jan 16, 2026Updated last month
- [ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset☆90Sep 6, 2023Updated 2 years ago
- ☆10Mar 23, 2023Updated 2 years ago
- ☆10Nov 10, 2021Updated 4 years ago
- Source code of DisenHAN: Disentangled Heterogeneous Graph Attention Network for Recommendation, CIKM 2020☆14Mar 18, 2023Updated 2 years ago
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- Test Code for Super Resolution in MRI☆11Sep 17, 2018Updated 7 years ago
- ☆12Nov 19, 2024Updated last year
- Cell2location paper - Comprehensive mapping of tissue cell architecture via integrated single cell and spatial transcriptomics☆15Nov 26, 2022Updated 3 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- ☆13Feb 8, 2017Updated 9 years ago
- A fine multimodality fusion network :)☆11Aug 9, 2021Updated 4 years ago
- This is the official repository for "Can GPTs Evaluate Graphic Design Based on Design Principles?".☆13Feb 10, 2025Updated last year
- This repository hosts the paper “LLM Based Math Tutoring: Challenges and Dataset”, along with the accompanying dataset. It explores the p…☆56Aug 29, 2024Updated last year
- Unity发布的WebGL支持录音☆12Mar 5, 2024Updated 2 years ago
- Distributed INRs to learn an encoding of a large-scale volume for query on a local machine with limited computational resources.☆11May 8, 2024Updated last year
- Multimodal Affective Analysis Using Hierarchical Attention Strategy☆12Dec 7, 2018Updated 7 years ago
- ☆13Mar 25, 2021Updated 4 years ago
- A machine learning and computer vision based application to recognize hand gestures and facial tracking, and subsequently display corresp…☆14Dec 28, 2020Updated 5 years ago
- js 解析 dicom 文件(基于 cornerstone.js 实现)☆10Jun 8, 2018Updated 7 years ago
- pytorch implementation of Semantics-AssistedVideoCaptioning☆11Feb 16, 2023Updated 3 years ago
- A Tensorflow implementation of Speech Emotion Recognition using Audio signals and Text Data☆12May 16, 2022Updated 3 years ago
- Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging☆11Dec 2, 2021Updated 4 years ago
- Instant neural graphics primitives: lightning fast NeRF and more☆12Aug 9, 2022Updated 3 years ago
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Jan 3, 2018Updated 8 years ago
- View volumetric (3D) medical images in Jupyter notebooks☆15Oct 19, 2023Updated 2 years ago
- An Application that can control a timer with just a Look at your hand. Not Kidding...Seriously.☆10May 30, 2020Updated 5 years ago
- Facebook Hatebook Memes Challenge☆12Jan 28, 2021Updated 5 years ago
- Streaming JSON parser designed to process JSON data incrementally. The primary goal is to handle potentially incomplete JSON data streams…☆12Apr 5, 2025Updated 11 months ago
- Added Gradio UI to accompany "A Method for Animating Children's Drawings of the Human Figure"☆13May 24, 2023Updated 2 years ago
- Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue☆14Oct 12, 2021Updated 4 years ago
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"☆15Oct 12, 2023Updated 2 years ago
- A Master Thesis Project on Video Keyword Extractor using Video Summarization techniques.☆11Oct 25, 2020Updated 5 years ago
- Implementation of the multi-time-scale convolution layer used in the paper Multi-Time-Scale Convolution for Emotion Recognition from Spee…☆11Oct 22, 2019Updated 6 years ago
- ☆14Jan 23, 2025Updated last year
- Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"☆10Dec 19, 2021Updated 4 years ago