UARK-AICV / VLTinT

[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
66Updated 11 months ago

Alternatives and similar repositories for VLTinT:

Users that are interested in VLTinT are comparing it to the libraries listed below