inuwamobarak / Image-captioning-ViTView external linksLinks
Image Captioning Vision Transformers (ViTs) are transformer models that generate descriptive captions for images by combining the power of Transformers and computer vision. It leverages state-of-the-art pre-trained ViT models and employs technique
☆39Oct 14, 2024Updated last year
Alternatives and similar repositories for Image-captioning-ViT
Users that are interested in Image-captioning-ViT are comparing it to the libraries listed below
Sorting:
- ☆10Dec 25, 2024Updated last year
- ☆12Nov 21, 2025Updated 2 months ago
- Power Apps Service Desk template Fixed☆13Dec 22, 2024Updated last year
- This repo host Power Apps Canvas YAML pieces of code, and serves as a gallery of canvas building blocks for the Power Apps community.☆15May 26, 2025Updated 8 months ago
- Advanced Analytics data collection for M365 usage☆19Jan 29, 2026Updated 2 weeks ago
- Power Platform Connectors snippets☆11Aug 11, 2022Updated 3 years ago
- Open source for SiTunes, a situational music recommendation feedback dataset that includes physiological, psychological, and environmenta…☆10Mar 15, 2024Updated last year
- ☆11Oct 22, 2023Updated 2 years ago
- ☆13Nov 12, 2025Updated 3 months ago
- ☆10Dec 10, 2023Updated 2 years ago
- A Power BI template that generates a Process Behaviour Chart (PBC) to visualize the variability and predictablity agile teams using Jira …☆13Dec 19, 2025Updated last month
- Benchmark dataset for the paper "Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with …☆23May 20, 2025Updated 8 months ago
- Kohya's GUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experie…☆18Nov 15, 2024Updated last year
- Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…☆11May 16, 2024Updated last year
- 2nd Place Solution - Kaggle Challenge: Learning Equality - Curriculum Recommendations☆13Mar 28, 2023Updated 2 years ago
- Voiceflow Wordpress plugin☆14Nov 17, 2023Updated 2 years ago
- ☆10Nov 18, 2024Updated last year
- Dark mode menu with neon glow effect!☆11Sep 18, 2022Updated 3 years ago
- ☆10Jul 29, 2024Updated last year
- Collection of usefull scripts for RunPod pods☆15Jan 26, 2024Updated 2 years ago
- Modern Data Grid component for Canvas apps☆10Nov 20, 2024Updated last year
- In this repository, you can find the resources from the session by Cathrine Bruvold and Daniel Laskewitz.☆21Dec 9, 2025Updated 2 months ago
- Get up in the morning by striking a pose to stop your alarm from ringing.☆12Jun 9, 2021Updated 4 years ago
- ☆11Dec 20, 2025Updated last month
- Claymorphic Mobile Navigation Menu for your Canvas Apps!☆11Sep 18, 2022Updated 3 years ago
- [ICASSP'2025] "M³Rec: Selective State Space Models with Mixture-of-Modality Experts for Multi-Modal Sequential Recommendation"☆11Jul 9, 2025Updated 7 months ago
- Tiled samplers for ComfyUI☆12Nov 27, 2024Updated last year
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- core for Modeling Dual Period-Varying Preferences for Takeaway Recommendation☆12Dec 12, 2023Updated 2 years ago
- GiMeFive: Towards Interpretable Facial Emotion Classification 😄😲😭😡🤢😨 (PyTorch Implementation)☆16Jul 6, 2024Updated last year
- ☆12Jan 10, 2023Updated 3 years ago
- Code/Notebook used for Machine Learning/Deep Learning or Programming Talks☆13Jul 25, 2024Updated last year
- Codes for TOIS Paper: Efficient On-Device Session-Based Recommendation☆12May 25, 2023Updated 2 years ago
- Streamline data pipelines for AI. Process datasets across 1000s of machines, and optimize data for blazing fast model training.☆15Sep 18, 2024Updated last year
- The source code for the paper "Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation, ACM TOIS 2026".☆13Dec 20, 2025Updated last month
- A simple sidebar for your ConfyUI!☆12Mar 21, 2024Updated last year
- ☆15Dec 20, 2024Updated last year
- KDD2025 | Multi-granularity Interest Retrieval and Refinement Network for Long-Term User Behavior Modeling in CTR Prediction☆15Feb 17, 2025Updated 11 months ago
- A C++11 / GLSL library for enhancing photographs (adjusting brightness, contrast, etc.)☆13Feb 20, 2022Updated 3 years ago