[ICIP 2022 oral] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
☆28Jun 28, 2023Updated 2 years ago
Alternatives and similar repositories for VLCAP
Users that are interested in VLCAP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning☆68Feb 16, 2024Updated 2 years ago
- [Lab] lab website☆11Mar 18, 2026Updated last week
- [IJCV] AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation☆20Jul 2, 2024Updated last year
- [ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation☆147Aug 19, 2024Updated last year
- [NeurIPS 2023] Official Implementation of "PaintSeg: Painting Pixels for Training-free Segmentation"☆14Dec 31, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Video Feature Extractor for S3D-HowTo100M☆29Apr 30, 2021Updated 4 years ago
- ☆16Dec 4, 2025Updated 3 months ago
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Jan 16, 2025Updated last year
- Theano-based implementation of the efficient sparse-coding algorithms by Honglak Lee et al. (2006)☆12Jan 4, 2016Updated 10 years ago
- Generative Models for Image Captioning☆10Jun 7, 2017Updated 8 years ago
- converting the pretrained tensorflow SoundNet model to pytorch☆14Jun 15, 2022Updated 3 years ago
- Implementation of our paper "Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation". Accepted in EACL …☆11May 22, 2023Updated 2 years ago
- ☆26Dec 20, 2024Updated last year
- Adapted from the widely used project webpage template made by the colorful folks.☆42Aug 8, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code for GHA (ACCV2018)☆13Oct 31, 2018Updated 7 years ago
- Re-thinking Co-Salient Object Detection, TPAMI 2021☆24Jan 26, 2023Updated 3 years ago
- Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector☆11Jun 24, 2023Updated 2 years ago
- ☆10Sep 26, 2023Updated 2 years ago
- ☆10Oct 7, 2023Updated 2 years ago
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆20May 7, 2022Updated 3 years ago
- PyTorch implementation of soft-nms☆31Nov 24, 2025Updated 4 months ago
- Code for "Time-Aware Auto White Balance in Mobile Photography"☆28Jan 25, 2026Updated 2 months ago
- ☆15Nov 19, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important tempo…☆23Mar 8, 2024Updated 2 years ago
- Code and data for experiments on semantic fragments☆11Jun 23, 2022Updated 3 years ago
- pip install poai☆14Mar 2, 2026Updated 3 weeks ago
- NICE challenge 2023 Track2 2nd result(total 4th) (CVPR 2023) sponsered by LG AI/Shutterstock/SNU☆11Jun 22, 2023Updated 2 years ago
- A simple and effective feature extractor for untrimmed videos☆13Sep 1, 2022Updated 3 years ago
- ☆11Jun 27, 2023Updated 2 years ago
- ☆18Dec 8, 2024Updated last year
- ☆11Sep 15, 2023Updated 2 years ago
- [CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.☆50Sep 30, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- MICCAI 2022: Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions☆12Sep 17, 2022Updated 3 years ago
- ☆15Dec 7, 2022Updated 3 years ago
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆77Sep 19, 2025Updated 6 months ago
- Implementation of a neural network MLP in C++.☆10Dec 17, 2018Updated 7 years ago
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Sep 19, 2024Updated last year
- Code for our IJCAI 2019 paper entitled "Conditional GAN with Discriminative Filter Generation for Text-to-Video Synthesis"☆14Mar 29, 2022Updated 3 years ago
- ☆11Dec 8, 2022Updated 3 years ago