Evaluation tools for image captioning. Including BLEU, ROUGE-L, CIDEr, METEOR, SPICE scores.
☆34Feb 17, 2023Updated 3 years ago
Alternatives and similar repositories for bleu-rouge-meteor-cider-spice-eval4imagecaption
Users that are interested in bleu-rouge-meteor-cider-spice-eval4imagecaption are comparing it to the libraries listed below
Sorting:
- [ACL2023] Official code repository for VLN-Trans☆14Sep 10, 2023Updated 2 years ago
- Gradio demo used in our Osprey:Pixel Understanding with Visual Instruction Tuning.☆16Dec 19, 2023Updated 2 years ago
- ☆12Jun 21, 2022Updated 3 years ago
- ContextBLIP : Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions☆10May 17, 2024Updated last year
- Code of the paper "Correctable Landmark Discovery via Large Models for Vision-Language Navigation" (TPAMI 2024)☆16Jun 7, 2024Updated last year
- ☆13May 21, 2024Updated last year
- Demonstrates iterative FGSM on Apple's NeuralHash model.☆16Aug 19, 2021Updated 4 years ago
- Vehicle registration plate recognition using convolutional neural networks☆11Nov 30, 2022Updated 3 years ago
- ☆22Jun 30, 2023Updated 2 years ago
- ☆15Jul 9, 2024Updated last year
- ☆11Jul 11, 2023Updated 2 years ago
- Implemention based on lightrag and nano-graphrag to connect with psql☆15Oct 28, 2024Updated last year
- Reversi AI based on Monte Carlo search algorithm☆10Apr 2, 2025Updated 11 months ago
- ☆10Jun 22, 2021Updated 4 years ago
- ☆15Jan 15, 2021Updated 5 years ago
- This code was used to collect, process, and validate the REFLACX (Reports and Eye-Tracking Data for Localization of Abnormalities in Ches…☆18Apr 6, 2022Updated 3 years ago
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆27Oct 20, 2025Updated 5 months ago
- 华中科技大学网络安全课程设计-Linux下的状态检测防火墙☆11Oct 17, 2022Updated 3 years ago
- A library for training crosscoders☆16May 28, 2025Updated 9 months ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆17Jun 5, 2024Updated last year
- ☆13Nov 23, 2019Updated 6 years ago
- Kakao Mobility MCP Server for directions and transit information☆10Sep 14, 2025Updated 6 months ago
- CORE-ReID: Comprehensive Optimization and Refinement through Ensemble fusion in Domain Adaptation for person re-identification☆15May 7, 2025Updated 10 months ago
- ☆19Sep 19, 2022Updated 3 years ago
- 삼각형의 실전! Triton☆16Feb 15, 2024Updated 2 years ago
- Large language model of Medical AI, General Medical AI (GMAI)☆17Jan 30, 2024Updated 2 years ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 2 months ago
- PyTorch implementation of LARS (Layer-wise Adaptive Rate Scaling)☆19May 11, 2019Updated 6 years ago
- ☆10Oct 26, 2022Updated 3 years ago
- Type-Specific Adversarial Attack for Object Detection☆13Aug 27, 2021Updated 4 years ago
- A curated list of resources on Document Layout Analysis☆11Aug 7, 2025Updated 7 months ago
- Repository for image caption for Chinese☆28Dec 3, 2017Updated 8 years ago
- Trying to classify the 20BN-JESTER hand gesture data set using a few architectures.☆17May 8, 2018Updated 7 years ago
- A quick way to get started with Transformer Lens☆14Dec 13, 2023Updated 2 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- Notes on Virtual Try-On Paper.记录一些经典虚拟试穿论文的笔记。☆11Jan 3, 2025Updated last year
- ☆10Oct 17, 2021Updated 4 years ago
- This is a code repository of Graphhopper: Multi-Hop Scene GraphReasoning for Visual Question Answering☆19Oct 30, 2021Updated 4 years ago
- The official implementation for Candidate Set Re-ranking for Composed Image Retrieval (TMLR) 01/2024☆20Feb 7, 2024Updated 2 years ago