ali-k-hesar / how-AI-Sees-Our-WorldView external linksLinks
Vision Transformer (ViT) models, with their attention mechanisms, revolutionized computer vision. By merging Class Activation Map (CAM) and Gradient-weighted Class Activation Map (Grad-CAM) concepts with ViT's attention maps, we gain deeper insights into how these models perceive and analyze images.
☆13Aug 14, 2023Updated 2 years ago
Alternatives and similar repositories for how-AI-Sees-Our-World
Users that are interested in how-AI-Sees-Our-World are comparing it to the libraries listed below
Sorting:
- MFC 学生信息管理系统☆11Jan 15, 2021Updated 5 years ago
- simple implementation of Expected Gradients and Integrated Gradients by pytorch☆12May 11, 2022Updated 3 years ago
- This is a classification task based on CIFAR10,Accuracy is about 87%(without pre-training),The net is CoAtNet(0-5,total coatnet family),w…☆10Oct 1, 2023Updated 2 years ago
- Code for the AAAI 2021 paper "Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition".☆10Nov 21, 2022Updated 3 years ago
- Interpreting CLIP with Hierarchical Sparse Autoencoders (ICML 2025)☆19Jan 17, 2026Updated last month
- ☆14May 20, 2025Updated 8 months ago
- Continual learning strategies(EWC, GEM) for rotated MNIST dataset☆12Apr 6, 2020Updated 5 years ago
- dMel: Speech Tokenization Made Simple☆16May 13, 2025Updated 9 months ago
- ☆12Mar 5, 2024Updated last year
- [🔥ACM MM2025] EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation☆23Dec 30, 2025Updated last month
- Semantic Image Segmentation - Severstal Steel Defect Detection Kaggle☆10Dec 9, 2020Updated 5 years ago
- ☆20Nov 21, 2025Updated 2 months ago
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- How to write an academic paper☆11Oct 20, 2022Updated 3 years ago
- ☆13Jul 28, 2024Updated last year
- Official implementation of "NoiseAR: AutoRegressing Initial Noise Prior for Diffusion Models"☆18Jun 3, 2025Updated 8 months ago
- Fine-grained Figure Skating dataset (FineFS) involves RGB videos and estimated skeleton data, providing rich annotations for multiple dow…☆18Sep 15, 2024Updated last year
- [IEEE TVT] FII-CenterNet: an anchor-free detector with foreground attention for traffic object detection☆13Jun 11, 2021Updated 4 years ago
- A Decade of Action Quality Assessment: Largest Systematic Survey of Trends, Challenges, and Future Directions☆15Jan 22, 2026Updated 3 weeks ago
- ☆13Oct 9, 2024Updated last year
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆14Dec 16, 2024Updated last year
- ☆14Oct 17, 2023Updated 2 years ago
- An implementaion of PyTorch UNet segmentation model on VOC2012 dataset☆10Mar 1, 2023Updated 2 years ago
- ☆11Jun 9, 2023Updated 2 years ago
- [NCA] Official implementation of the paper Motion2Language, Unsupervised learning of synchronized semantic motion segmentation☆13Sep 9, 2024Updated last year
- [AAAI 2026] This repository is the official implementation of "ReAlign: Text-to-Motion Generation via Step-Aware Reward-Guided Alignment"…☆26Updated this week
- Code for data preparation and for how to train Mask RCNN model☆11May 8, 2021Updated 4 years ago
- ☆13Nov 20, 2023Updated 2 years ago
- ☆15Jan 22, 2024Updated 2 years ago
- A modular implementation of product of experts VAEs for multimodal data☆13Nov 15, 2021Updated 4 years ago
- Official implementation of "PAPR in Motion: Seamless Point-level 3D Scene Interpolation"☆12Nov 6, 2024Updated last year
- Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning (CVPR 2025, pytorch co…☆14Sep 29, 2025Updated 4 months ago
- [ACL 2025 🔥] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts☆18May 22, 2025Updated 8 months ago
- The official PyTorch code for AAAI'23 Paper "Sparse Coding in a Dual Memory System for Lifelong Learning"☆12Feb 15, 2023Updated 3 years ago
- Code for "Continual Learning of Object Instances", Implemented in PyTorch, https://arxiv.org/abs/2004.10862☆11Jun 12, 2020Updated 5 years ago
- [ICLR'25] The first benchmark aiming to evaluate whether LMMs can assist oracle bone inscription processing tasks☆20Mar 21, 2025Updated 10 months ago
- Author's implementation of learning virtual chimeras by dynamic motion reassembly (SIGGRAPH Asia 2022 Technical Paper)☆14Feb 20, 2023Updated 2 years ago
- Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".☆39Jun 9, 2025Updated 8 months ago
- The official PyTorch implementation of "MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion" in ECCV 2024.☆19Jul 6, 2025Updated 7 months ago