ViT models pretrained with up to ~5k hours of human-like video data
☆14Aug 10, 2023Updated 2 years ago
Alternatives and similar repositories for humanlike-vits
Users that are interested in humanlike-vits are comparing it to the libraries listed below
Sorting:
- Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", publish…☆20Jun 3, 2024Updated last year
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Jun 2, 2024Updated last year
- ☆19Apr 28, 2023Updated 2 years ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Dec 27, 2024Updated last year
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25May 16, 2024Updated last year
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet☆32Aug 22, 2023Updated 2 years ago
- Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations ICCV23☆30Dec 30, 2024Updated last year
- Repository related to Cranfield's AAI MSCs GDP☆11Apr 8, 2023Updated 2 years ago
- ☆10Updated this week
- ☆12Mar 25, 2025Updated 11 months ago
- Beyond Accuracy: What Matters in Designing Well-Behaved Models?☆18Updated this week
- ☆11Mar 11, 2024Updated last year
- ☆10Feb 10, 2026Updated 3 weeks ago
- This repo contains the code to reproduce figures in my dissertation "Passive Imaging and Characterization of the Subsurface With Distribu…☆10Jun 14, 2018Updated 7 years ago
- ☆41Sep 25, 2023Updated 2 years ago
- Pytorch implementation of Visual DNA, an approach to represent and compare images.☆40Feb 14, 2024Updated 2 years ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- P1AC: Revisiting Absolute Pose From a Single Affine Correspondence☆11Mar 19, 2024Updated last year
- ☆12Jun 26, 2024Updated last year
- [Advanced Photonics Research, 2021] Control tightly focused fields via manipulating pupil functions☆10Dec 25, 2024Updated last year
- Visual Concept Connectome☆15Jun 23, 2024Updated last year
- Data Programming for Text Detection in Documents using SPEAR☆12Mar 26, 2025Updated 11 months ago
- The sparse Bayesian learning sandbox☆11Jul 4, 2021Updated 4 years ago
- Open Data and sources for OSINT in Tajikistan☆13Jan 17, 2025Updated last year
- Tensorflow implementation of the paper "Fast Compressive Sensing Using Generative Model with Structed Latent Variables"☆10Apr 7, 2020Updated 5 years ago
- [ACM MM 2025] LMM4Edit: Benchmarking and Evaluating Multimodal Image Editing with LMMs☆15Feb 10, 2026Updated 3 weeks ago
- LED : Light Enhanced Depth Estimation at Night☆14Dec 9, 2025Updated 2 months ago
- ☆10Updated this week
- ☆41Mar 27, 2024Updated last year
- ☆10Feb 23, 2017Updated 9 years ago
- [ACM MM 2024 (Oral)] Official PyTorch Implementation of Paper "MovingColor: Seamless Fusion of Fine-grained Video Color Enhancement"☆11Dec 30, 2024Updated last year
- ☆10Nov 15, 2023Updated 2 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- ☆10Sep 5, 2024Updated last year
- Official implementation of SGDiff (ACM MM '23)☆37Nov 26, 2023Updated 2 years ago
- Image Manipulation Detection and Localization☆10Aug 10, 2023Updated 2 years ago
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Jan 6, 2023Updated 3 years ago
- [EMNLP 2022] Fine-grained Category Discovery under Coarse-grained supervision with Hierarchical Weighted Self-contrastive Learning☆14Jun 22, 2024Updated last year