Optimized code based on M2 for faster image captioning training
☆21Nov 18, 2022Updated 3 years ago
Alternatives and similar repositories for Transformer-Captioning
Users that are interested in Transformer-Captioning are comparing it to the libraries listed below
Sorting:
- Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).☆202Jun 8, 2022Updated 3 years ago
- Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)☆123Dec 17, 2022Updated 3 years ago
- [CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .☆21Nov 28, 2022Updated 3 years ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Feb 15, 2023Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- Image Caption workout with NIC and NBT☆15Apr 5, 2019Updated 6 years ago
- This repository contains 2 tools: - A py3 Lib for NLP & image-caption metrics - Code for a two-tailed t-test with paired samples. It wil…☆18Apr 4, 2021Updated 4 years ago
- Lightweight Transformer for Multi-modal Tasks☆16Dec 9, 2022Updated 3 years ago
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Jun 6, 2022Updated 3 years ago
- ☆22Jun 30, 2023Updated 2 years ago
- Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.☆24Aug 5, 2023Updated 2 years ago
- ☆24Apr 4, 2022Updated 3 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- ☆61Oct 23, 2021Updated 4 years ago
- ☆29Oct 19, 2022Updated 3 years ago
- Meshed-Memory Transformer for Image Captioning. CVPR 2020☆545Dec 21, 2022Updated 3 years ago
- Towards Local Visual Modeling for Image Captioning☆29Mar 31, 2023Updated 2 years ago
- Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for …☆61Oct 21, 2022Updated 3 years ago
- Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)☆65Oct 19, 2020Updated 5 years ago
- GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)☆198May 9, 2023Updated 2 years ago
- [ICML2024]The official implementation of SemiRES in PyTorch.☆33Jun 20, 2024Updated last year
- Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)☆75Aug 25, 2021Updated 4 years ago
- ☆85Dec 4, 2022Updated 3 years ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago
- Data generation code for Ditto☆12Apr 28, 2022Updated 3 years ago
- [NeurIPS'25 Spotlight] This is the official codebase for the paper: STAR: A Benchmark for Astronomical Star Fields Super-Resolution☆15Oct 9, 2025Updated 4 months ago
- ☆43Jun 1, 2023Updated 2 years ago
- NightSurveillance Sataset for Pedestrian Detection☆11Jul 30, 2020Updated 5 years ago
- Training a BERT model from scratch.☆11Oct 15, 2023Updated 2 years ago
- This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".☆12Jan 30, 2021Updated 5 years ago
- Code for "Learning Harmonic Molecular Representations on Riemannian Manifold", ICLR, 2023☆10Mar 23, 2023Updated 2 years ago
- Measure the diversity of image descriptions, repository for our COLING 2018 paper.☆13Dec 29, 2019Updated 6 years ago
- ☆14Jan 5, 2024Updated 2 years ago
- Some commonly used functions and modules☆10Jan 15, 2024Updated 2 years ago
- This is an official implementation of video classification for our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Atten…☆12Jan 30, 2021Updated 5 years ago
- 一个签到☆10Mar 14, 2024Updated last year
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- ☆10Jan 20, 2021Updated 5 years ago