Image Captioning Vision Transformers (ViTs) are transformer models that generate descriptive captions for images by combining the power of Transformers and computer vision. It leverages state-of-the-art pre-trained ViT models and employs technique
☆41Oct 14, 2024Updated last year
Alternatives and similar repositories for Image-captioning-ViT
Users that are interested in Image-captioning-ViT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14May 7, 2024Updated 2 years ago
- In todays era, due to the surge in the usage of internet and other online platforms, security has been a major concern. Many cyber attack…☆11Apr 11, 2021Updated 5 years ago
- Implementation of the CPTR model by https://arxiv.org/pdf/2101.10804.pdf☆10Mar 27, 2022Updated 4 years ago
- ☆15Apr 9, 2024Updated 2 years ago
- ☆12Jul 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 基于FastAPI + LangChain + OpenAI API + Vue的AI表格处理工具,用于智能化处理和分析表格数据。☆20Jul 14, 2025Updated 11 months ago
- URL phishing detection using Generative Adversarial Network (GAN)☆14Oct 14, 2022Updated 3 years ago
- 清华大学人工智能导论(龙明盛老师)课程课件,作业以及试题☆17Jun 26, 2023Updated 2 years ago
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆26May 21, 2026Updated 3 weeks ago
- This project aims at filtering out the human vocals of songs using a library Spleeter, and turning them into instrumental versions, creat…☆12Jul 18, 2021Updated 4 years ago
- The official implementation of Bayesian Cross-modal Alignment Learning for Few-Shot Out-of-Distribution Generalization (AAAI2023).☆12Oct 13, 2025Updated 8 months ago
- Web App Capable of Predicting Next Word Using BERT☆15Dec 8, 2022Updated 3 years ago
- Power Platform Connectors snippets☆11Aug 11, 2022Updated 3 years ago
- [IJCV2025] https://arxiv.org/abs/2304.04521☆15Jan 22, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A large-scale benchmark for the evaluation of embeddings across a number of fine-grained and instance-level visual domains.☆17Jun 14, 2024Updated 2 years ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- ☆11Aug 22, 2023Updated 2 years ago
- ☆13Dec 3, 2021Updated 4 years ago
- The torchosr module is a set of tools for Open Set Recognition in Python, compatible with PyTorch library.☆13Mar 11, 2025Updated last year
- [CVPR 2023] Code for the paper "Masked Images Are Counterfactual Samples for Robust Fine-tuning"☆14Mar 24, 2023Updated 3 years ago
- ☆16May 24, 2024Updated 2 years ago
- Power Apps Service Desk template Fixed☆14Dec 22, 2024Updated last year
- ☆15Aug 30, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The source code of [WWW 2025] MoDiCF☆14Mar 26, 2026Updated 2 months ago
- Animated text for your next Power Apps project!☆11Mar 9, 2023Updated 3 years ago
- Model calibration in CLIP Adapters☆20Aug 19, 2024Updated last year
- This repo host Power Apps Canvas YAML pieces of code, and serves as a gallery of canvas building blocks for the Power Apps community.☆18Apr 17, 2026Updated last month
- Get up in the morning by striking a pose to stop your alarm from ringing.☆12Jun 9, 2021Updated 5 years ago
- ☆12Jan 10, 2023Updated 3 years ago
- ☆37Jan 5, 2018Updated 8 years ago
- Welcome to Power BI Embedded Step by Step Series. Using this GitHub Repository you can download complete solution.☆15Jan 2, 2021Updated 5 years ago
- Image Captioning Using Transformer☆270Jun 23, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Using a CNN-LSTM hybrid network to generate captions for images☆18Nov 19, 2019Updated 6 years ago
- ☆26Apr 27, 2023Updated 3 years ago
- Animated dark/light mode toggle for your #PowerApps☆10Jul 3, 2023Updated 2 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 9 months ago
- Python command line tools as productivity supplements for Posix systems☆17Apr 4, 2024Updated 2 years ago
- ☆11Nov 18, 2024Updated last year
- Benchmark dataset for the paper "Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with …☆27May 20, 2025Updated last year