cqels / vision
☆19Updated last month
Alternatives and similar repositories for vision:
Users that are interested in vision are comparing it to the libraries listed below
- Python Tools for Visual Dataset Transformation☆26Updated last week
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 4 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 2 weeks ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆59Updated 4 months ago
- ☆24Updated last year
- ☆26Updated 3 years ago
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations" (ICML 2024)☆32Updated last year
- ☆12Updated 7 months ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆32Updated 2 years ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆35Updated last year
- Official code repository for the WACV 2022 paper "Visualizing Paired Image Similarity in Transformer Networks"☆22Updated 2 years ago
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆35Updated 4 months ago
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆13Updated 3 years ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆14Updated 4 months ago
- Code for paper Rethinking the Data Annotation Process for Multi-view 3D Pose Estimation with Active Learning and Self-Training☆22Updated last year
- ☆18Updated last month
- ScrollNet for Continual Learning☆11Updated last year
- Directed masked autoencoders☆14Updated 2 years ago
- Official Code for MIMETIC^2☆12Updated 4 months ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆37Updated 3 years ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆42Updated 2 months ago
- ☆30Updated 2 years ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆18Updated 3 months ago
- Lottery Ticket Adaptation☆38Updated 4 months ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Updated last year
- [CVPR'23 Highlight] Heterogeneous Continual Learning.☆16Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆32Updated last year
- ☆16Updated 2 years ago
- ☆13Updated 2 years ago