UARK-AICV/VLCAP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UARK-AICV/VLCAP)

UARK-AICV / VLCAP

[ICIP 2022 oral] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

☆28

Alternatives and similar repositories for VLCAP

Users that are interested in VLCAP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UARK-AICV / TrackGUI
View on GitHub
☆23Nov 11, 2024Updated last year
UARK-AICV / VLTinT
View on GitHub
[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
☆68Feb 16, 2024Updated 2 years ago
UARK-AICV / UARK-AICV.github.io
View on GitHub
[Lab] lab website
☆12May 29, 2026Updated last month
UARK-AICV / AOE-Net
View on GitHub
[IJCV] AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation
☆20Jul 2, 2024Updated 2 years ago
UARK-AICV / OpenFusion
View on GitHub
[ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation
☆154Aug 19, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
UARK-AICV / TSRNet
View on GitHub
[ISBI 2024] An implementation of TSRNet for ECG Anomaly Detection
☆24Apr 11, 2024Updated 2 years ago
UARK-AICV / 3DConvCaps
View on GitHub
[ICPR 2022] 3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image Segmentation
☆48Jun 26, 2022Updated 4 years ago
UARK-AICV / AerialFormer
View on GitHub
[Remote Sensing] AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation
☆70Apr 23, 2024Updated 2 years ago
UARK-AICV / ECG_SSL_12Lead
View on GitHub
[IEEE BHI 2022] Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning
☆44Oct 4, 2024Updated last year
UARK-AICV / AISFormer
View on GitHub
[BMVC 2022] AISFormer: Amodal Instance Segmentation with Transformer
☆46Nov 24, 2024Updated last year
vhvkhoa / TAPG-AgentEnvInteration
View on GitHub
☆10Nov 10, 2022Updated 3 years ago
UARK-AICV / FG-CXR
View on GitHub
The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…
☆12Jul 28, 2025Updated 11 months ago
lxa9867 / PaintSeg
View on GitHub
[NeurIPS 2023] Official Implementation of "PaintSeg: Painting Pixels for Training-free Segmentation"
☆14Dec 31, 2023Updated 2 years ago
UARK-AICV / SAM3D
View on GitHub
[ISBI 2024] An implementation of SAM3D which adapts Segment Anything Model for Volumetric Medical Image Segmentation
☆85May 28, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ArrowLuo / VideoFeatureExtractor
View on GitHub
Video Feature Extractor for S3D-HowTo100M
☆29Apr 30, 2021Updated 5 years ago
mlvlab / MELTR
View on GitHub
MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)
☆35Apr 23, 2024Updated 2 years ago
hiteshK03 / Remote-sensing-image-captioning-with-transformer-and-multilabel-classification
View on GitHub
☆18Nov 23, 2022Updated 3 years ago
dagingehelgoy / Master
View on GitHub
Generative Models for Image Captioning
☆10Jun 7, 2017Updated 9 years ago
MarkPotanin / copy_paste_aug_detectron2
View on GitHub
Copy-paste augmentation in detectron2 pipeline
☆35Mar 25, 2021Updated 5 years ago
elliottwu / webpage-template
View on GitHub
Adapted from the widely used project webpage template made by the colorful folks.
☆42Aug 8, 2021Updated 4 years ago
craft-hand / CRAFT-Hand_API
View on GitHub
☆15Jun 4, 2026Updated last month
DengPingFan / CoEGNet
View on GitHub
Re-thinking Co-Salient Object Detection, TPAMI 2021
☆24Jan 26, 2023Updated 3 years ago
soham97 / ADIFF
View on GitHub
Explaining audio differences using language
☆16Feb 11, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
zhangyuygss / SVFSal
View on GitHub
Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector
☆11Jun 24, 2023Updated 3 years ago
tianyuan168326 / EAN-Pytorch
View on GitHub
Official PyTorch Implementation of paper EAN: Event Adaptive Network for Efficient Action Recognition https://arxiv.org/abs/2107.10771
☆33Oct 24, 2023Updated 2 years ago
PanXiebit / PiSLRTc
View on GitHub
[TMM 2021] PiSLTRc: Position-informed Sign Language Transformer with Content-aware Convolution
☆11Dec 9, 2021Updated 4 years ago
yongcaoplus / TIN-SLT
View on GitHub
Code for Paper "Explore More Guidance: A Task-aware Instruction Network for Sign Language Translation Enhanced with Data Augmentation"
☆12Feb 6, 2023Updated 3 years ago
minrq / CGAN_Text2Video
View on GitHub
Code for our IJCAI 2019 paper entitled "Conditional GAN with Discriminative Filter Generation for Text-to-Video Synthesis"
☆14Mar 29, 2022Updated 4 years ago
GauravGajbhiye / SCAMET_RSIC
View on GitHub
This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.
☆13Aug 10, 2023Updated 2 years ago
XuMengyaAmy / ReportDALS
View on GitHub
☆16Nov 19, 2020Updated 5 years ago
m-decoster / fpt4slt
View on GitHub
Frozen Pretrained Transformers for Neural Sign Language Translation
☆15Apr 23, 2022Updated 4 years ago
VideoAnalysis / EDUVSUM
View on GitHub
EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important tempo…
☆23Mar 8, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
weiminye / Hands-On-Artificial-Intelligence-for-Banking-Chinese
View on GitHub
《金融中的人工智能》配套代码
☆11Sep 20, 2022Updated 3 years ago
KoDohwan / VT-TWINS
View on GitHub
Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)
☆11Oct 12, 2022Updated 3 years ago
rovle / gpt3-in-context-fitting
View on GitHub
Experiments on GPT-3's ability to fit numerical models in-context.
☆14Aug 11, 2022Updated 3 years ago
yizhou42 / MfH
View on GitHub
☆19May 15, 2026Updated 2 months ago
QiQAng / UEDVC
View on GitHub
☆12May 26, 2023Updated 3 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
MarcusNerva / HMN
View on GitHub
[CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.
☆50Sep 30, 2022Updated 3 years ago