hkust-vgd / MarineGPTLinks
The official implementation of MarineGPT
☆36Updated last year
Alternatives and similar repositories for MarineGPT
Users that are interested in MarineGPT are comparing it to the libraries listed below
Sorting:
- Official repository for the General Robust Image Task (GRIT) Benchmark☆54Updated 2 years ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated last year
- ☆30Updated 2 years ago
- [ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"☆52Updated 2 years ago
- [ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…☆38Updated 2 years ago
- ☆15Updated 3 years ago
- Vision-oriented multimodal AI☆49Updated last year
- Connecting segment-anything's output masks with the CLIP model; Awesome-Segment-Anything-Works☆202Updated last year
- [NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".☆111Updated 2 years ago
- Open-vocabulary Semantic Segmentation☆33Updated last year
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆34Updated last year
- Segment Anything Is Not Always Perfect: An Investigation of SAM on Different Real-World Application☆32Updated 11 months ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆57Updated last year
- EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)☆32Updated 2 years ago
- [CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.☆91Updated 2 years ago
- Official PyTorch code for HILA☆28Updated 3 years ago
- CV′3315 Is All You Need – Semantic Segmentation Course Competition @ The University of Adelaide☆22Updated 2 years ago
- (ICLR 2024, CVPR 2024) SparseFormer☆75Updated 11 months ago
- [CVPR 2023] An official Pytorch implementation of "Masked Jigsaw Puzzle: A Versatile Position Embedding for Vision Transformers".☆42Updated 10 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆48Updated last year
- Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"☆26Updated last year
- Image Instance Segmentation - Zero Shot - OpenAI's CLIP + Meta's SAM☆72Updated 2 years ago
- Entry to the 2023 Scroll Prize☆44Updated 2 years ago
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆85Updated 2 years ago
- Official PyTorch implementation of the paper "DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training".☆58Updated 2 years ago
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated last year
- Baby-DALL3: Annotation anything in visual tasks and Generate anything just all in one-pipeline with GPT-4 (a small baby of DALL·E 3).☆83Updated 2 years ago
- (ECCV 2024) Can OOD Object Detectors Learn from Foundation Models?☆25Updated 11 months ago
- MIMIC: Masked Image Modeling with Image Correspondences☆16Updated last year
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆20Updated 10 months ago