EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important temporal segments in educational videos.
☆23Mar 8, 2024Updated 2 years ago
Alternatives and similar repositories for EDUVSUM
Users that are interested in EDUVSUM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)☆144Apr 8, 2023Updated 3 years ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆27Mar 18, 2021Updated 5 years ago
- ☆17Aug 6, 2021Updated 4 years ago
- ☆12Aug 7, 2024Updated last year
- ☆17Jul 25, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- ☆14Sep 20, 2025Updated 8 months ago
- [ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos☆127Sep 29, 2023Updated 2 years ago
- Code for GHA (ACCV2018)☆13Oct 31, 2018Updated 7 years ago
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Jun 7, 2023Updated 2 years ago
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆57Jan 14, 2022Updated 4 years ago
- PyTorch implementation of the models described in the IEEE ICASSP 2022 paper "Is cross-attention preferable to self-attention for multi-m…☆65Mar 29, 2025Updated last year
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- 3D household task-based dataset created using customised AI2-THOR.☆14Apr 14, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2023] Pytorch Code of MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering☆17Jul 11, 2023Updated 2 years ago
- Code supporting the ISMIR 2020 Klio Tutorial☆20Oct 11, 2020Updated 5 years ago
- Toolbox for IBP Coupled SPCM-CRP Hidden Markov Model. Also contains code for EM-based HMM learning and inference for Bayesian non-paramet…☆14Mar 21, 2019Updated 7 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Reference implementation and test synthetic data for Sorted Center Time echo density measure for acoustic impulse responses☆15Mar 18, 2020Updated 6 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- VIA modification for sign language annotation☆18Apr 30, 2021Updated 5 years ago
- ☆11May 18, 2022Updated 4 years ago
- Source codes and datasets for the paper "Incorporating Anticipation Embedding into Reinforcement Learning Framework for Multi-hop Knowled…☆12Aug 20, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 5 years ago
- Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging☆11Dec 2, 2021Updated 4 years ago
- IJCAI 2022 MLP4Rec☆17Sep 5, 2022Updated 3 years ago
- [ICIP 2022 oral] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning☆28Jun 28, 2023Updated 2 years ago
- This repository hosts the paper “LLM Based Math Tutoring: Challenges and Dataset”, along with the accompanying dataset. It explores the p…☆57Aug 29, 2024Updated last year
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".☆57Oct 22, 2023Updated 2 years ago
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Jan 3, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code for the ACL 2021 Findings paper "Yichi Zhang and Joyce Chai. Hierarchical Task Learning from Language Instructions with Uni…☆24Jun 28, 2021Updated 4 years ago
- 知识文档问答,用大模型与文档对话,提供Al分析、阅读、问答工具,助你快速了解文档内容。☆21Sep 4, 2024Updated last year
- Generalized Product Quantization Network For Semi-supervised Image Retrieval - CVPR 2020☆63May 27, 2024Updated last year
- 本项目是基于讯飞星火的智能数据分析平台☆25Aug 28, 2024Updated last year
- DeerSheep0314 / Re4-Learning-to-Re-contrast-Re-attend-Re-construct-for-Multi-interest-Recommendation☆23Aug 4, 2022Updated 3 years ago
- ☆16Oct 11, 2022Updated 3 years ago
- An attempt at genre classification with convolutional neural networks and spectrograms☆15Nov 25, 2017Updated 8 years ago