cqels / visionLinks
☆19Updated 3 months ago
Alternatives and similar repositories for vision
Users that are interested in vision are comparing it to the libraries listed below
Sorting:
- SSL Video Representation Learning project☆14Updated 6 months ago
- [NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".☆111Updated 2 years ago
- Official repository for the General Robust Image Task (GRIT) Benchmark☆54Updated 2 years ago
- [CVPR 2023 Highlight] Beyond mAP: Towards better evaluation of instance segmentation☆27Updated 2 years ago
- Code for paper Rethinking the Data Annotation Process for Multi-view 3D Pose Estimation with Active Learning and Self-Training☆22Updated 2 years ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆37Updated 2 years ago
- ☆83Updated 2 years ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆48Updated last year
- ☆25Updated 2 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Updated 2 years ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆102Updated last year
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆103Updated 2 years ago
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆61Updated 2 years ago
- Multi-model video-to-text by combining embeddings from Flan-T5 + CLIP + Whisper + SceneGraph. The 'backbone LLM' is pre-trained from scra…☆54Updated 2 years ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- ☆37Updated 2 years ago
- Python Tools for Visual Dataset Transformation☆29Updated 4 months ago
- Vision-oriented multimodal AI☆49Updated last year
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆58Updated last year
- [CVPR'23 Highlight] Heterogeneous Continual Learning.☆15Updated 2 years ago
- Official implementation for Sparse MetA-Tuning (SMAT)☆17Updated 6 months ago
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆55Updated 5 months ago
- ☆32Updated 3 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Updated last year
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆19Updated 3 years ago
- ☆43Updated last year
- ☆26Updated 3 years ago
- ☆80Updated last year