This repository contains 2 tools: - A py3 Lib for NLP & image-caption metrics - Code for a two-tailed t-test with paired samples. It will reveals whether the difference of two results is significant. In this code, we complete evaluation code for Spice details(*i.e.*,Object, Relation, Attribute, Color, Count, and Size ).
☆18Apr 4, 2021Updated 4 years ago
Alternatives and similar repositories for ImageCaptionMetrics
Users that are interested in ImageCaptionMetrics are comparing it to the libraries listed below
Sorting:
- Optimized code based on M2 for faster image captioning training☆21Nov 18, 2022Updated 3 years ago
- Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)☆123Dec 17, 2022Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- Code of SSAN☆69Mar 7, 2024Updated 2 years ago
- Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"☆63Apr 16, 2021Updated 4 years ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Feb 15, 2023Updated 3 years ago
- ☆67Nov 11, 2022Updated 3 years ago
- [ECCV2024]The official implementation of the DiffPNG paper in PyTorch.☆17Oct 17, 2024Updated last year
- an improvement of the paper: Learning to Detect Violent Videos using Convolution LSTM☆11Jun 1, 2020Updated 5 years ago
- [NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation☆47Jun 29, 2023Updated 2 years ago
- Magic ELF: Image Deraining Meets Association Learning and Transformer☆14Sep 13, 2023Updated 2 years ago
- The implementation for ACL 2022 paper☆20Aug 14, 2022Updated 3 years ago
- Video classification using convGRU☆13Feb 15, 2018Updated 8 years ago
- Image Caption workout with NIC and NBT☆15Apr 5, 2019Updated 6 years ago
- Towards Modality-Agnostic Person Re-identification with Descriptive Query CVPR2023☆29Aug 4, 2024Updated last year
- Code for TGRS 2021 paper. Edge-Aware Multiscale Feature Integration Network for Salient Object Detection in Optical Remote Sensing Images…☆13Apr 6, 2022Updated 3 years ago
- Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)☆65Oct 19, 2020Updated 5 years ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- [CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .☆21Nov 28, 2022Updated 3 years ago
- code scripts for blog posts I published☆12Jun 17, 2020Updated 5 years ago
- code for downloading videos from HowTo100M dataset☆17May 13, 2021Updated 4 years ago
- Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.☆24Aug 5, 2023Updated 2 years ago
- Breast tumor segmentation and shape classification in mammograms using generative adversarial and convolutional neural network☆13Jul 30, 2021Updated 4 years ago
- 计算机相关知识笔记☆10Mar 14, 2026Updated last week
- ImageNet training code of Res2Net☆15Nov 2, 2020Updated 5 years ago
- Code repo for "SketchODE: Learning neural sketch representation in continuous time" published in ICLR 2022☆11Apr 19, 2022Updated 3 years ago
- ☆10Aug 21, 2021Updated 4 years ago
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆29Aug 4, 2024Updated last year
- 3d face reconstruction, expression. 3D人脸重建出不同面部表情。☆10Aug 29, 2020Updated 5 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Jul 20, 2023Updated 2 years ago
- https://www.kaggle.com/c/nbme-score-clinical-patient-notes☆10Sep 1, 2022Updated 3 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- An unofficial implementation of Dual Refinement Underwater Object Detection Network☆11Apr 15, 2022Updated 3 years ago
- 运动车辆检测☆12Jul 28, 2018Updated 7 years ago
- ☆12Mar 14, 2023Updated 3 years ago
- ☆13Jun 26, 2022Updated 3 years ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆144Oct 30, 2024Updated last year
- ☆30Nov 15, 2023Updated 2 years ago