UARK-AICV / VLTinT

[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
66Updated last year

Alternatives and similar repositories for VLTinT:

Users that are interested in VLTinT are comparing it to the libraries listed below