CaptainEven / VideoCaption
视频的文本摘要(标注),输入一段视频,通过深度学习网络和人工智能程序识别视频主要表达的意思(Input a video output a txt decribing the video)。
☆179Updated 6 years ago
Related projects: ⓘ
- 图像中文描述☆94Updated 6 years ago
- 图像中文描述+视觉注意力☆182Updated 4 years ago
- 500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型(TensorFlow2.0)。☆127Updated 5 years ago
- ☆159Updated this week
- This repository contains script to divide a video into key frames.☆158Updated 6 years ago
- 这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。 视频描述生成任务指的是:输入一个视频,输出一句描述整个视频内容的文字(前提是视频较短且可以用一句话来描述)。本repo主要目的是帮助视力障碍…☆78Updated 2 years ago
- 这是用opencv以颜色直方图法进行的关键帧提取 和 用Python以流光分析写的关键帧提取☆46Updated 7 years ago
- Repository for image caption for Chinese☆27Updated 6 years ago
- PyTorch implementation of Chinese image captioning on AI_challenger dataset☆34Updated 4 years ago
- Cross-lingual image captioning☆82Updated 2 years ago
- Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".☆208Updated 4 years ago
- 深度学习实现图像中文描述☆24Updated 5 years ago
- Detect the face in each key frame which extracts from the movie☆25Updated 3 years ago
- 看图说话,基于keras,支持GPU。Image captioning code in keras, runs on GPU.☆23Updated 4 years ago
- ☆134Updated 5 years ago
- 爱奇艺多模态人物识别比赛,排名第四☆69Updated 5 years ago
- video synopsis, video enrichment, 视频浓缩,视频摘要☆34Updated 4 years ago
- Chinese Visual Question Answering 中文看图问答☆47Updated 7 years ago
- TALL: Temporal Activity Localization via Language Query☆184Updated 6 years ago
- Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy☆56Updated 3 years ago
- A reimplementation of Show and Tell☆15Updated 5 years ago
- This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as i…☆165Updated 4 years ago
- Code for AAAI2020 paper "Fast Learning of Temporal Action Proposal via Dense Boundary Generator"☆348Updated last year
- pytorch implementation of video captioning☆400Updated 5 years ago
- 基于ClipCap的看图说话Image Caption模型☆271Updated 2 years ago
- code for fluency-guided cross-lingual image captioning☆32Updated 6 years ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆56Updated 3 years ago
- Pytorch C3D feature extractor☆129Updated 6 years ago
- Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks☆178Updated last year
- A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015☆47Updated last year