Validating image classification benchmark results on ViTs and ResNets (v2)
☆13Nov 3, 2022Updated 3 years ago
Alternatives and similar repositories for transformers-transforming-vision
Users that are interested in transformers-transforming-vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation"…☆52Nov 14, 2023Updated 2 years ago
- [⭐ CVPR 2025 Highlight ⭐] Official Implementation of the paper STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing fro…☆31Apr 22, 2025Updated last year
- Code for "Out-of-Distribution Detection using Synthetic Data Generation"☆20Feb 6, 2025Updated last year
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆308Apr 3, 2024Updated 2 years ago
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology☆12Jun 17, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆26Aug 31, 2023Updated 2 years ago
- ☆11Oct 29, 2024Updated last year
- [ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".☆12Oct 11, 2024Updated last year
- ☆26Mar 13, 2026Updated 3 months ago
- ML model trained on data from Bayut.com to predict housing prices in Dubai☆17Aug 21, 2025Updated 9 months ago
- (ICCV 2023) Generative Multiplane Neural Radiance for 3D Aware Image Generation.☆18Sep 28, 2023Updated 2 years ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆102Apr 30, 2024Updated 2 years ago
- (BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" …☆35Jan 8, 2023Updated 3 years ago
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆85May 18, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Jul 24, 2022Updated 3 years ago
- My thesis project for person re-identification☆14Dec 8, 2022Updated 3 years ago
- [CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos☆104Apr 14, 2025Updated last year
- [CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".☆821Jul 24, 2023Updated 2 years ago
- Official repository for "Intriguing Properties of Vision Transformers" (NeurIPS 2021--Spotlight)☆183Aug 9, 2022Updated 3 years ago
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabi…☆79Sep 24, 2024Updated last year
- ☆41Sep 21, 2023Updated 2 years ago
- [CVPR 2023] Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection☆31Jun 21, 2023Updated 2 years ago
- ☆36Feb 5, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Source code for MICCAI 2022 paper entitled: 'Self-Ensembling Vision Transformer (SEViT) for Robust Medical Image Classification'☆36Jan 13, 2023Updated 3 years ago
- [ECCV'22] Official repository of paper titled "Class-agnostic Object Detection with Multi-modal Transformer".☆315May 9, 2023Updated 3 years ago
- [CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Ap…☆416Jul 25, 2023Updated 2 years ago
- ☆42Nov 9, 2023Updated 2 years ago
- Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"☆22Dec 31, 2025Updated 5 months ago
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆40Nov 27, 2024Updated last year
- ☆10May 14, 2023Updated 3 years ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆107Feb 11, 2024Updated 2 years ago
- ☆17May 20, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [AAAI'25, CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆128Dec 17, 2024Updated last year
- Pytorch implementation of the StarNet paper algorithm☆10Jan 25, 2022Updated 4 years ago
- A method to generate counterfactuals☆12Feb 24, 2026Updated 3 months ago
- Offline handwritten mathematical expression recognition via stroke extraction for Android☆14Sep 7, 2020Updated 5 years ago
- Handwritten Math Expressions Recognition☆13Sep 8, 2017Updated 8 years ago
- Official repository of "A Hitchhiker's Guide to Fine-Grained Face Forgery Detection Using Common Sense Reasoning" published in NeurIPS'20…☆12Feb 7, 2025Updated last year
- [CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the…☆46May 26, 2025Updated last year