Image Captioning Vision Transformers (ViTs) are transformer models that generate descriptive captions for images by combining the power of Transformers and computer vision. It leverages state-of-the-art pre-trained ViT models and employs technique
☆40Oct 14, 2024Updated last year
Alternatives and similar repositories for Image-captioning-ViT
Users that are interested in Image-captioning-ViT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于movielens-25m数据集的生成式推荐项目☆36Aug 6, 2025Updated 7 months ago
- ☆12Jul 20, 2024Updated last year
- 如何训练一个方言大模型☆27Dec 5, 2024Updated last year
- ☆17May 31, 2023Updated 2 years ago
- Improving neural network representations using human similarity judgments☆13Nov 22, 2024Updated last year
- ☆13Nov 21, 2025Updated 4 months ago
- The official implementation of Bayesian Cross-modal Alignment Learning for Few-Shot Out-of-Distribution Generalization (AAAI2023).☆20Oct 13, 2025Updated 5 months ago
- the answer of cs231n assignment 123 with resolution☆16Aug 28, 2023Updated 2 years ago
- [IJCV2025] https://arxiv.org/abs/2304.04521☆15Jan 22, 2025Updated last year
- A large-scale benchmark for the evaluation of embeddings across a number of fine-grained and instance-level visual domains.☆17Jun 14, 2024Updated last year
- ☆11Aug 22, 2023Updated 2 years ago
- ☆15Mar 9, 2023Updated 3 years ago
- Code of GraphAdapter☆17Mar 21, 2024Updated 2 years ago
- Advanced Analytics data collection for M365 usage☆20Mar 9, 2026Updated 2 weeks ago
- Power Apps Service Desk template Fixed☆13Dec 22, 2024Updated last year
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 8 months ago
- ☆10Dec 25, 2024Updated last year
- Animated text for your next Power Apps project!☆11Mar 9, 2023Updated 3 years ago
- Streamline data pipelines for AI. Process datasets across 1000s of machines, and optimize data for blazing fast model training.☆16Sep 18, 2024Updated last year
- ☆12Jan 10, 2023Updated 3 years ago
- Welcome to Power BI Embedded Step by Step Series. Using this GitHub Repository you can download complete solution.☆15Jan 2, 2021Updated 5 years ago
- Prompt Tuning based Adapter for Vision-Language Model Adaption☆16Sep 1, 2023Updated 2 years ago
- Animated dark/light mode toggle for your #PowerApps☆10Jul 3, 2023Updated 2 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 7 months ago
- Python command line tools as productivity supplements for Posix systems☆17Apr 4, 2024Updated last year
- ☆10Nov 18, 2024Updated last year
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆88Mar 20, 2024Updated 2 years ago
- In this repository, you can find the resources from the session by Cathrine Bruvold and Daniel Laskewitz.☆22Dec 9, 2025Updated 3 months ago
- Claymorphic Mobile Navigation Menu for your Canvas Apps!☆11Sep 18, 2022Updated 3 years ago
- Benchmark dataset for the paper "Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with …☆23May 20, 2025Updated 10 months ago
- Modern Data Grid component for Canvas apps☆10Nov 20, 2024Updated last year
- Blog of the LibreCV.org☆11May 17, 2021Updated 4 years ago
- ☆15Dec 20, 2024Updated last year
- Simple repository contribution statistics☆15Mar 6, 2026Updated 2 weeks ago
- A framework for studying the best practises for Mahalanobis distance for OOD detection☆19Dec 5, 2024Updated last year
- ☆10Dec 10, 2023Updated 2 years ago
- A small collection of custom nodes for use with ComfyUI, for geometry calculations☆13Sep 30, 2024Updated last year
- ChatBot App built using LangChain and Lightning AI☆17Mar 4, 2023Updated 3 years ago
- Materials for "Transformers from the Ground Up" at PyData Jeddah on August 5, 2021☆20Aug 5, 2021Updated 4 years ago