Recent Advances in Vision-Language Pre-training!
☆32Jan 10, 2022Updated 4 years ago
Alternatives and similar repositories for awesome-vision-language-modeling
Users that are interested in awesome-vision-language-modeling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for CELL-E: Biological Zero-Shot Text-to-Image Synthesis for Protein Localization Prediction☆29Oct 1, 2023Updated 2 years ago
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆33Mar 15, 2022Updated 4 years ago
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆13Apr 11, 2023Updated 3 years ago
- base on MIL method☆13Jul 25, 2024Updated last year
- Code for "From Pixel to Patch: Synthesize Context-aware Features for Zero-shot Semantic Segmentation".☆36Jan 19, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the public repo for the course HMMA238 'Software Development'☆11Apr 20, 2021Updated 5 years ago
- ☆11Jun 4, 2021Updated 5 years ago
- PFA-ScanNet: Pyramidal Feature Aggregation With Synergistic Learning for Breast Cancer Metastasis Analysis (Architecture Only Pytorch Imp…☆21Aug 8, 2019Updated 6 years ago
- A novel physical adversarial attack tackling the Digital-to-Physical Visual Inconsistency problem.☆12Feb 5, 2025Updated last year
- This is the official released code for our paper, The Emergence of Objectness: Learning Zero-Shot Segmentation from Videos, which has bee…☆53Apr 14, 2023Updated 3 years ago
- Remote sensing Image Captioning is a special case of Image Captioning which solves the difficulties in processing the remote sensing imag…☆12Jun 16, 2021Updated 5 years ago
- 3D_Coronary_Artery_Segmentation☆12Feb 23, 2022Updated 4 years ago
- Contains tools for generalized Procrustes analysis, active shape models and shape-based image warping☆10Jun 28, 2014Updated 12 years ago
- ☆65Oct 11, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Jul 11, 2022Updated 3 years ago
- Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"☆19Feb 1, 2026Updated 4 months ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated 2 years ago
- Official code of "StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis" (CVPR 2022)☆43Jul 31, 2022Updated 3 years ago
- ☆14Feb 17, 2023Updated 3 years ago
- pytorch implementation of XMC-GAN☆11Jun 2, 2021Updated 5 years ago
- An automated data pipeline scaling RL to pretraining levels☆77Jun 2, 2026Updated 3 weeks ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆16Nov 20, 2025Updated 7 months ago
- ☆11Nov 10, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ECCVW/TWYN 2024 - Best Workshop Paper] Are CLIP features all you need for Universal Synthetic Image Origin Attribution?☆13Mar 27, 2026Updated 3 months ago
- ☆12May 3, 2024Updated 2 years ago
- The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…☆14Feb 12, 2026Updated 4 months ago
- Simple PyTorch Dataset for the EPIC-Kitchens-55 and EPIC-Kitchens-100 that handles frames and features (rgb, optical flow, and objects) f…☆24Jan 22, 2023Updated 3 years ago
- 登录脚本☆12Nov 4, 2022Updated 3 years ago
- 🎵 Partnership with AI to create Beats☆11Oct 13, 2020Updated 5 years ago
- A neural network software for using Molecular labelling to improve pathological annotation of H and E tissues☆32Jan 29, 2025Updated last year
- Code to compute (and backpropagate) FID for a mini-batch of fake samples.☆16Oct 13, 2021Updated 4 years ago
- [MICCAI-2023]Visual-Attribute Prompt Learning for Progressive Mild Cognitive Impairment Prediction☆15Dec 12, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICCV 2023] Distilling Coarse-to-fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding☆14Oct 2, 2024Updated last year
- ☆14May 6, 2021Updated 5 years ago
- Semantic Projection Network for Zero- and Few-Label Semantic Segmentation (CVPR 2019)☆52Sep 25, 2019Updated 6 years ago
- Supervised Training of Conditional Monge Maps☆19Oct 30, 2023Updated 2 years ago
- Recaptured Screen Image Demoiréing. (TCSVT 2020)☆26Apr 8, 2021Updated 5 years ago
- Multi-Scale Representation Attention based Deep Multiple Instance Learning for Gigapixel Whole Slide Image Analysis☆26Aug 26, 2023Updated 2 years ago
- ☆15Feb 24, 2023Updated 3 years ago