cqels / visionLinks
☆19Updated 7 months ago
Alternatives and similar repositories for vision
Users that are interested in vision are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".☆111Updated 2 years ago
- [CVPR 2023 Highlight] Beyond mAP: Towards better evaluation of instance segmentation☆27Updated 2 years ago
- ☆26Updated 3 years ago
- Official repository for the General Robust Image Task (GRIT) Benchmark☆54Updated 2 years ago
- ☆13Updated 3 years ago
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆48Updated last week
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models" ICLR 2024☆104Updated last year
- ☆32Updated 3 years ago
- ScrollNet for Continual Learning☆11Updated last year
- Python Tools for Visual Dataset Transformation☆28Updated last month
- Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification. ECCV 2022.☆18Updated 3 years ago
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated 2 years ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- Code for the Ask4Help project☆22Updated 2 years ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆36Updated last year
- ☆34Updated 2 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- Official implementation for Sparse MetA-Tuning (SMAT)☆18Updated last month
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 11 months ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆19Updated 2 years ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆45Updated 8 months ago
- ☆70Updated last month
- ☆26Updated last year
- ☆24Updated 2 years ago
- Directed masked autoencoders☆14Updated 2 years ago
- Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"☆30Updated 4 months ago
- [CVPR'23 Highlight] Heterogeneous Continual Learning.☆16Updated last year
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆57Updated last year
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆103Updated last year
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 9 months ago