This repository contains 2 tools: - A py3 Lib for NLP & image-caption metrics - Code for a two-tailed t-test with paired samples. It will reveals whether the difference of two results is significant. In this code, we complete evaluation code for Spice details(*i.e.*,Object, Relation, Attribute, Color, Count, and Size ).
☆18Apr 4, 2021Updated 5 years ago
Alternatives and similar repositories for ImageCaptionMetrics
Users that are interested in ImageCaptionMetrics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Optimized code based on M2 for faster image captioning training☆21Nov 18, 2022Updated 3 years ago
- Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)☆122Dec 17, 2022Updated 3 years ago
- 对常用图像分类和目标检测模型的封装📦,目前已经停更(模型也许已经过时)☆10Mar 27, 2021Updated 5 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- ☆36Nov 3, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code of SSAN☆73Jun 6, 2026Updated 3 weeks ago
- Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"☆63Apr 16, 2021Updated 5 years ago
- Retrieval-augmented Image Captioning☆13Feb 16, 2023Updated 3 years ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 3 years ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Feb 15, 2023Updated 3 years ago
- ☆67Nov 11, 2022Updated 3 years ago
- an improvement of the paper: Learning to Detect Violent Videos using Convolution LSTM☆11Jun 1, 2020Updated 6 years ago
- SODA: Story Oriented Dense Video Captioning Evaluation Framework☆14May 3, 2024Updated 2 years ago
- [NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation☆48Jun 29, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Magic ELF: Image Deraining Meets Association Learning and Transformer☆14Sep 13, 2023Updated 2 years ago
- Video classification using convGRU☆13Feb 15, 2018Updated 8 years ago
- The implementation for ACL 2022 paper☆20Aug 14, 2022Updated 3 years ago
- Image Caption workout with NIC and NBT☆16Apr 5, 2019Updated 7 years ago
- Towards Modality-Agnostic Person Re-identification with Descriptive Query CVPR2023☆31Aug 4, 2024Updated last year
- Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)☆65Oct 19, 2020Updated 5 years ago
- Code for TGRS 2021 paper. Edge-Aware Multiscale Feature Integration Network for Salient Object Detection in Optical Remote Sensing Images…☆12Apr 6, 2022Updated 4 years ago
- [CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .☆21Nov 28, 2022Updated 3 years ago
- code for downloading videos from HowTo100M dataset☆18May 13, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.☆25Aug 5, 2023Updated 2 years ago
- Breast tumor segmentation and shape classification in mammograms using generative adversarial and convolutional neural network☆13Jul 30, 2021Updated 4 years ago
- ☆13Apr 16, 2022Updated 4 years ago
- 3d face reconstruction, expression. 3D人脸重建出不同面部表情。☆10Aug 29, 2020Updated 5 years ago
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆32Aug 4, 2024Updated last year
- Single Image Deraining via Recurrent Hierarchy Enhancement Network (ACM'MM2019)☆18Dec 28, 2019Updated 6 years ago
- Managed L2D tool libs. (In Dev)☆14Apr 20, 2019Updated 7 years ago
- ☆12Sep 19, 2021Updated 4 years ago
- ImageNet training code of Res2Net☆16Nov 2, 2020Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- https://www.kaggle.com/c/nbme-score-clinical-patient-notes☆10Sep 1, 2022Updated 3 years ago
- Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.☆12Mar 2, 2022Updated 4 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- An unofficial implementation: Dual Refinement Underwater Object Detection Network☆11May 11, 2026Updated last month
- Class Incremental learning, Task Incremental Learning☆17Dec 19, 2022Updated 3 years ago
- ☆13Jun 26, 2022Updated 4 years ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆144Oct 30, 2024Updated last year