xmu-xiaoma666/ImageCaptionMetrics

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xmu-xiaoma666/ImageCaptionMetrics)

xmu-xiaoma666 / ImageCaptionMetrics

This repository contains 2 tools: - A py3 Lib for NLP & image-caption metrics - Code for a two-tailed t-test with paired samples. It will reveals whether the difference of two results is significant. In this code, we complete evaluation code for Spice details(*i.e.*,Object, Relation, Attribute, Color, Count, and Size ).

☆18

Alternatives and similar repositories for ImageCaptionMetrics

Users that are interested in ImageCaptionMetrics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

luo3300612 / Transformer-Captioning
View on GitHub
Optimized code based on M2 for faster image captioning training
☆21Nov 18, 2022Updated 3 years ago
zhangxuying1004 / RSTNet
View on GitHub
Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)
☆123Dec 17, 2022Updated 3 years ago
zifyloo / SSAN
View on GitHub
Code of SSAN
☆74Jun 6, 2026Updated last month
ZhiyinShao-H / LGUR
View on GitHub
☆36Nov 3, 2022Updated 3 years ago
TencentYoutuResearch / PersonReID-NAFS
View on GitHub
Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"
☆63Apr 16, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RitaRamo / extra
View on GitHub
Retrieval-augmented Image Captioning
☆13Feb 16, 2023Updated 3 years ago
Adit31 / Captionomaly-Deep-Learning-Toolbox-for-Anomaly-Captioning
View on GitHub
Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos
☆13Jun 26, 2023Updated 3 years ago
LeeYN-43 / Clover
View on GitHub
Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)
☆40Feb 15, 2023Updated 3 years ago
w5688414 / EfficientNet-ViolenceDetection
View on GitHub
an improvement of the paper: Learning to Detect Violent Videos using Convolution LSTM
☆11Jun 1, 2020Updated 6 years ago
ruotianluo / coco-caption
View on GitHub
☆67Nov 11, 2022Updated 3 years ago
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
View on GitHub
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆18Sep 17, 2021Updated 4 years ago
yuppielqx / Time-FFM
View on GitHub
☆16May 10, 2024Updated 2 years ago
fujiso / SODA
View on GitHub
SODA: Story Oriented Dense Video Captioning Evaluation Framework
☆14May 3, 2024Updated 2 years ago
Mi-Peng / Sparse-Sharpness-Aware-Minimization
View on GitHub
[NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation
☆48Jun 29, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ShiqiaoZhou / BALM-TSF
View on GitHub
Official implement of CIKM '25 paper: BALM-TSF
☆17Nov 25, 2025Updated 7 months ago
kuijiang94 / Magic-ELF
View on GitHub
Magic ELF: Image Deraining Meets Association Learning and Transformer
☆14Sep 13, 2023Updated 2 years ago
filick / GRU-RCN
View on GitHub
Video classification using convGRU
☆13Feb 15, 2018Updated 8 years ago
ccq195 / UNIReID
View on GitHub
Towards Modality-Agnostic Person Re-identification with Descriptive Query CVPR2023
☆31Aug 4, 2024Updated last year
fenglinliu98 / MIA
View on GitHub
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" （NeurIPS 2019）
☆65Oct 19, 2020Updated 5 years ago
BorealisAI / PETSA
View on GitHub
[PUT ICML 2025] Accurate Parameter-Efficient Test-Time Adaptation for Time Series Forecasting
☆19May 30, 2025Updated last year
Kunye-Shen / EMFINet
View on GitHub
Code for TGRS 2021 paper. Edge-Aware Multiscale Feature Integration Network for Salient Object Detection in Optical Remote Sensing Images…
☆12Apr 6, 2022Updated 4 years ago
LandyGuo / Download_HowTo100M
View on GitHub
code for downloading videos from HowTo100M dataset
☆18May 13, 2021Updated 5 years ago
bomri / code-for-posts
View on GitHub
code scripts for blog posts I published
☆12Jun 17, 2020Updated 6 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
mrwu-mac / DIFNet
View on GitHub
[CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .
☆21Nov 28, 2022Updated 3 years ago
Kexin-Tang / CS-Zoo
View on GitHub
计算机相关知识笔记
☆10Jun 21, 2026Updated last month
nini0919 / DiffPNG
View on GitHub
[ECCV2024]The official implementation of the DiffPNG paper in PyTorch.
☆17Oct 17, 2024Updated last year
Shujun-He / 3rd_Solution_Feedback_Prize_Evaluating_Student_Writing
View on GitHub
☆13Apr 16, 2022Updated 4 years ago
vivek231 / breast_tumor_segmentation
View on GitHub
Breast tumor segmentation and shape classification in mammograms using generative adversarial and convolutional neural network
☆13Jul 30, 2021Updated 4 years ago
dasayan05 / sketchode
View on GitHub
Code repo for "SketchODE: Learning neural sketch representation in continuous time" published in ICLR 2022
☆11Apr 19, 2022Updated 4 years ago
upura / commonlitreadabilityprize
View on GitHub
☆10Aug 21, 2021Updated 4 years ago
HiBugs / 3DFaceReconstruction_ForExpression
View on GitHub
3d face reconstruction, expression. 3D人脸重建出不同面部表情。
☆10Aug 29, 2020Updated 5 years ago
vkverma01 / EFT
View on GitHub
Class Incremental learning, Task Incremental Learning
☆17Dec 19, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LydiaXiaohongLi / Megatron-DeepSpeed
View on GitHub
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆19Jul 20, 2023Updated 3 years ago
NiaBie / FreeLive
View on GitHub
Managed L2D tool libs. (In Dev)
☆14Apr 20, 2019Updated 7 years ago
liruilong940607 / mint
View on GitHub
Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.
☆12Mar 2, 2022Updated 4 years ago
ht014 / SG2HOI
View on GitHub
☆12Sep 19, 2021Updated 4 years ago
Zacchaeus00 / nbme
View on GitHub
https://www.kaggle.com/c/nbme-score-clinical-patient-notes
☆10Sep 1, 2022Updated 3 years ago
Res2Net / Res2Net-ImageNet-Training
View on GitHub
ImageNet training code of Res2Net
☆16Nov 2, 2020Updated 5 years ago
neo85824 / epsnet
View on GitHub
Implementation of EPSNet for panoptic segmentation on PyTorch
☆14Sep 5, 2020Updated 5 years ago