[ICIP 2022 oral] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
☆28Jun 28, 2023Updated 2 years ago
Alternatives and similar repositories for VLCAP
Users that are interested in VLCAP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning☆68Feb 16, 2024Updated 2 years ago
- [Lab] lab website☆11Mar 23, 2026Updated last month
- [IJCV] AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation☆20Jul 2, 2024Updated last year
- [ICPR 2022] 3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image Segmentation☆47Jun 26, 2022Updated 3 years ago
- ☆10Nov 10, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [BMVC 2022] AISFormer: Amodal Instance Segmentation with Transformer☆46Nov 24, 2024Updated last year
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 9 months ago
- [NeurIPS 2023] Official Implementation of "PaintSeg: Painting Pixels for Training-free Segmentation"☆14Dec 31, 2023Updated 2 years ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆75Aug 24, 2024Updated last year
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 7 months ago
- ☆18Dec 4, 2025Updated 5 months ago
- MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)☆35Apr 23, 2024Updated 2 years ago
- Explaining audio differences using language☆16Feb 11, 2025Updated last year
- ☆18Nov 23, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Jan 16, 2025Updated last year
- ☆13Jan 8, 2020Updated 6 years ago
- Generative Models for Image Captioning☆10Jun 7, 2017Updated 8 years ago
- Implementation of our paper "Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation". Accepted in EACL …☆11May 22, 2023Updated 2 years ago
- Examples of Verbalized Machine Learning (VML)☆16Mar 16, 2025Updated last year
- Adapted from the widely used project webpage template made by the colorful folks.☆42Aug 8, 2021Updated 4 years ago
- Code for GHA (ACCV2018)☆13Oct 31, 2018Updated 7 years ago
- ☆10Oct 7, 2023Updated 2 years ago
- ☆13Jun 26, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.☆13Aug 10, 2023Updated 2 years ago
- ☆15Nov 19, 2020Updated 5 years ago
- code for composite in situ imaging (cisi) analysis☆12Oct 26, 2020Updated 5 years ago
- Frozen Pretrained Transformers for Neural Sign Language Translation☆15Apr 23, 2022Updated 4 years ago
- Code and data for experiments on semantic fragments☆11Jun 23, 2022Updated 3 years ago
- NICE challenge 2023 Track2 2nd result(total 4th) (CVPR 2023) sponsered by LG AI/Shutterstock/SNU☆11Jun 22, 2023Updated 2 years ago
- A simple and effective feature extractor for untrimmed videos☆13Sep 1, 2022Updated 3 years ago
- [ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning☆171Dec 4, 2020Updated 5 years ago
- ☆19Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [AAAI 2025] Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving☆53Mar 4, 2026Updated 2 months ago
- ☆12May 26, 2023Updated 2 years ago
- AAAI 2018 (Spotlight)☆16Sep 7, 2024Updated last year
- rendezvous-in-time☆13Sep 17, 2025Updated 7 months ago
- MICCAI 2022: Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions☆13Sep 17, 2022Updated 3 years ago
- Code for our IJCAI 2019 paper entitled "Conditional GAN with Discriminative Filter Generation for Text-to-Video Synthesis"☆14Mar 29, 2022Updated 4 years ago
- A PyTorch implementation of VIOLET☆138Dec 17, 2023Updated 2 years ago