aimagelab/mvad-names-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aimagelab/mvad-names-dataset)

aimagelab / mvad-names-dataset

M-VAD Names Dataset. Multimedia Tools and Applications (2019)

☆24

Alternatives and similar repositories for mvad-names-dataset

Users that are interested in mvad-names-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yj-yu / CiSIN
View on GitHub
Character Grounding and Re-Identification in Story of Videos and Text Descriptions
☆10Jan 17, 2021Updated 5 years ago
shengyuzhang / Poet
View on GitHub
Poet: Product-oriented Video Captioner for E-commerce
☆12Sep 21, 2020Updated 5 years ago
jamespark3922 / lsmdc-fillin
View on GitHub
Identity-Aware Multi-Sentence Video Description
☆15Jun 12, 2023Updated 3 years ago
xiadingZ / video-caption-openNMT.pytorch
View on GitHub
implement video caption based on openNMT
☆36Apr 19, 2018Updated 8 years ago
WingsBrokenAngel / delving-deeper-into-the-decoder-for-video-captioning
View on GitHub
Source code for Delving Deeper into the Decoder for Video Captioning
☆39Jun 1, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sharpenb / Uncertainty-Event-Prediction
View on GitHub
Uncertainty on Asynchronous Time Event Prediction (Spotlight, Neurips 2019)
☆20Oct 8, 2020Updated 5 years ago
chaoyuaw / lvu
View on GitHub
☆87Mar 4, 2024Updated 2 years ago
bharath272 / semantic_contours
View on GitHub
Code for the ICCV 2011 paper"Semantic contours from inverse detectors"
☆12May 15, 2012Updated 14 years ago
ycxioooong / MovieSynopsisAssociation
View on GitHub
Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019
☆52Aug 9, 2020Updated 5 years ago
b05902062 / TDConvED
View on GitHub
implementation of TDConvED for video captioning
☆13Mar 18, 2020Updated 6 years ago
aleju / CharVectorizer
View on GitHub
Transform strings to vectors for neural networks.
☆15May 22, 2015Updated 11 years ago
ozansener / RecipeWatch
View on GitHub
☆12Jan 12, 2016Updated 10 years ago
Abdelrhman-Yasser / video-content-description
View on GitHub
Video content description model for generating descriptions for unconstrained videos
☆15Jul 5, 2019Updated 7 years ago
lvapeab / interactive-keras-captioning
View on GitHub
Interactive multimedia captioning with Keras
☆16Aug 2, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zinuoli / TriSense
View on GitHub
[NeurIPS 2025] Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM
☆27Feb 10, 2026Updated 5 months ago
WingsBrokenAngel / Semantics-AssistedVideoCaptioning
View on GitHub
Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy
☆55Jul 31, 2021Updated 4 years ago
jssprz / video_features_extractor
View on GitHub
Python implementation of extraction of several visual features representations from videos
☆23Jul 19, 2021Updated 4 years ago
amazon-science / crossmodal-contrastive-learning
View on GitHub
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021
☆62Feb 7, 2022Updated 4 years ago
vivoutlaw / SSIAM
View on GitHub
Self-supervised Siamese network (SSiam), FG 2019
☆27Apr 21, 2023Updated 3 years ago
hobincar / reconstruction-network-for-video-captioning
View on GitHub
☆20Sep 19, 2019Updated 6 years ago
zhenyangli / online_action
View on GitHub
☆14Sep 19, 2016Updated 9 years ago
MILVLG / mt-captioning
View on GitHub
A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning
☆25Sep 4, 2020Updated 5 years ago
Geo-Tell / DS-PMNet
View on GitHub
☆11Dec 11, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
hertasecurity / LTFT
View on GitHub
This repository contains the video files (download links) and corresponding annotations used in the paper "Long-Term Face Tracking for Cr…
☆14Dec 18, 2020Updated 5 years ago
Sundrops / video-caption.pytorch
View on GitHub
☆33Apr 20, 2018Updated 8 years ago
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
FrenchKrab / datasets-pyannote
View on GitHub
Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)
☆15Oct 22, 2025Updated 8 months ago
showlab / FocusUI
View on GitHub
[CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection
☆35Jun 7, 2026Updated last month
cvlab-epfl / social-scene-understanding
View on GitHub
Source code for the CVPR 2017 paper
☆64Apr 23, 2018Updated 8 years ago
bytedance / F-16
View on GitHub
F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…
☆39Jul 3, 2025Updated last year
cvlab-columbia / oops
View on GitHub
Code for Oops! Predicting Unintentional Action in Video
☆80Apr 13, 2020Updated 6 years ago
GingL / ARN
View on GitHub
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding
☆33Aug 29, 2019Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
eric-xw / kinetics-i3d-pytorch
View on GitHub
☆35Mar 22, 2019Updated 7 years ago
pwdonh / audio_tokens
View on GitHub
This is a Javascript toolbox to perform online rating studies with auditory material.
☆18Nov 18, 2024Updated last year
jalayrac / object-states-action
View on GitHub
Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017
☆14Aug 7, 2018Updated 7 years ago
Mengmi / deepfuturegaze_gan
View on GitHub
Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks
☆33Mar 12, 2020Updated 6 years ago
sysulic / FL-MSRE
View on GitHub
A Few-Shot Learning based Approach to Multimodal Social Relation Extraction
☆14Jan 17, 2023Updated 3 years ago
jssprz / visual_syntactic_embedding_video_captioning
View on GitHub
Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
☆30Apr 16, 2021Updated 5 years ago
MarvinLvn / BabySLM
View on GitHub
Behavioral probing of language acquisition models at the lexical and syntactic level
☆20Jul 17, 2023Updated 2 years ago