Video Captioning is an encoder decoder mode based on sequence to sequence learning
☆139Apr 9, 2024Updated 2 years ago
Alternatives and similar repositories for Video-Captioning
Users that are interested in Video-Captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)☆230Jan 3, 2024Updated 2 years ago
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Apr 6, 2021Updated 5 years ago
- pytorch implementation of video captioning☆401Aug 19, 2019Updated 6 years ago
- The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".☆57Oct 22, 2023Updated 2 years ago
- 视频的文本摘要(标注),输入一段视频,通过深度学习网络和人工智能程序识别视频主要表达的意思(Input a video output a txt decribing the video)。☆189Mar 20, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆41Jun 29, 2022Updated 3 years ago
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆151Jul 8, 2019Updated 6 years ago
- [CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.☆50Sep 30, 2022Updated 3 years ago
- Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy☆55Jul 31, 2021Updated 4 years ago
- Video Grounding and Captioning☆332Oct 12, 2021Updated 4 years ago
- Original Full Repository of the Paper: "Domain-Adaptive Self-Supervised Pre-training for Face & Body Detection in Drawings"☆20Oct 14, 2025Updated 8 months ago
- A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015☆48Nov 22, 2022Updated 3 years ago
- Deploy Swin Transformer using TorchServe☆29Jul 13, 2021Updated 4 years ago
- A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018☆53Apr 6, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- Source codes of our paper in TCSVT 2025: PLOVAD: Prompting Vision-Language Models for Open Vocabulary Video Anomaly Detection☆31Feb 15, 2025Updated last year
- Video content description model for generating descriptions for unconstrained videos☆15Jul 5, 2019Updated 6 years ago
- [ECCV 2024] LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation☆15Dec 23, 2024Updated last year
- This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP☆410Nov 14, 2022Updated 3 years ago
- The extended and verified music video emotion analysis dataset for data driven algorithm.☆18Aug 9, 2021Updated 4 years ago
- video captioning using 3DCNN and LSTM (pytorch)☆11Sep 26, 2019Updated 6 years ago
- [NeurIPS 2025] PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer☆33Oct 2, 2025Updated 8 months ago
- ☆15Jul 8, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021☆66Oct 21, 2021Updated 4 years ago
- Official code for "Audio-Guided Attention Network for Weakly Supervised Violence Detection" (ICCECE2022).☆13Mar 25, 2022Updated 4 years ago
- ☆10Jan 3, 2023Updated 3 years ago
- DOM2AFrame library to render HTML/CSS in WebVR☆24Dec 7, 2017Updated 8 years ago
- ☆192Jun 16, 2025Updated 11 months ago
- ☆29Jul 18, 2025Updated 10 months ago
- ☆26Oct 20, 2021Updated 4 years ago
- A Memory Network Approach for Story-based Temporal Summarization of 360° Videos☆12May 8, 2020Updated 6 years ago
- ☆10Aug 19, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Jun 28, 2021Updated 4 years ago
- GNURadio block to play back files with delays between replays and/or limited replay counts.☆15Sep 16, 2023Updated 2 years ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- Solution to the Soccernet 2024 Dense video captioning task from CVPR workshop☆12Jul 1, 2024Updated last year
- Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)☆75Aug 25, 2021Updated 4 years ago
- ImageNet3D: Towards General-Purpose Object-Level 3D Understanding☆21Dec 6, 2024Updated last year