UARK-AICV / VLTinT

[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
64Updated 8 months ago

Related projects

Alternatives and complementary repositories for VLTinT