cap-ntu / Video-to-Retail-Platform
An intelligent multimodal-learning based system for video, product and ads analysis. Based on the system, people can build a lot of downstream applications such as product recommendation, video retrieval, etc.
☆139Updated 3 years ago
Related projects: ⓘ
- Video Summarization using Statistics and Machine Learning☆31Updated 6 years ago
- Papers, code and datasets about deep learning and multi-modal learning for video analysis☆747Updated 2 years ago
- Semantically be able to search through a database of videos (using generated summaries)☆64Updated 5 years ago
- Code and demos for our paper at ACM MM 2017☆63Updated 5 years ago
- Video Summarization Dataset, Papers, Codes☆154Updated 6 years ago
- video-understanding:Video Classification, Action Recognition, Video Datasets☆141Updated 6 years ago
- Learning to align and match videos with kernelized temporal layers☆137Updated 2 years ago
- Experimenting with different Summarizing techniques on SumMe Dataset☆132Updated 4 years ago
- 7th place solution to The 3rd YouTube-8M Video Understanding Challenge☆37Updated 3 years ago
- Generating video descriptions using deep learning in Keras☆25Updated 4 years ago
- PyTorch Implementation of "Facial Image-to-Video Translation by a Hidden Affine Transformation" in MM'19.☆55Updated 4 years ago
- A collection of recent video understanding datasets, under construction!☆454Updated 6 years ago
- Detecting near-duplicate videos by aggregating features from intermediate CNN layers☆95Updated 6 years ago
- MassFace: an effecient implementation using triplet loss for face recognition☆56Updated 4 years ago
- 1st place solution to Kaggle's 2018 YouTube-8M Video Understanding Challenge☆199Updated last year
- Baseline Code for ADHA dataset☆65Updated 5 years ago
- Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation☆220Updated 4 months ago
- Four-in-one deep network: image search, image captioning, similar words and similar images using a single model☆136Updated 5 years ago
- FIVR-200K dataset from the "FIVR: Fine-grained Incident Video Retrieval" [TMM 2019]☆78Updated last year
- UCloud AI SDK☆33Updated last year
- ☆14Updated this week
- ☆24Updated 6 years ago
- This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as i…☆165Updated 4 years ago
- The iMaterialist Fashion Attribute Dataset☆82Updated 3 years ago
- Unofficial PyTorch Implementation of SUM-GAN from "Unsupervised Video Summarization with Adversarial LSTM Networks" (CVPR 2017)☆239Updated last year
- Code of PhoenixLin(3rd place) in the 2nd Youtube8M Video Understanding Challenge☆206Updated 5 years ago
- A PyTorch implementation of VSumPtrGAN☆40Updated 9 months ago
- ML☆108Updated 7 years ago
- This repo is used for generating faking labeled positive videos for SVD dataset.☆10Updated 4 years ago
- Feature extraction from videos based on intermediate layers of a Convolutional Neural Network.☆63Updated last year