Imageomics / bioclip
This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].
☆193Updated 2 weeks ago
Alternatives and similar repositories for bioclip:
Users that are interested in bioclip are comparing it to the libraries listed below
- [CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloadin…☆217Updated 5 months ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆122Updated 6 months ago
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆114Updated 11 months ago
- Object Recognition as Next Token Prediction (CVPR 2024 Highlight)☆174Updated 2 months ago
- Connecting segment-anything's output masks with the CLIP model; Awesome-Segment-Anything-Works☆188Updated 5 months ago
- [NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…☆309Updated last year
- ☆200Updated last year
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆158Updated last year
- Official repository of paper "Subobject-level Image Tokenization"☆65Updated 10 months ago
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆97Updated 10 months ago
- Python package that simplifies using the BioCLIP foundation model.☆26Updated this week
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆55Updated 8 months ago
- [CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"☆198Updated 9 months ago
- [ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"☆230Updated 2 months ago
- This is an official implementation for [ICLR'24] INTR: Interpretable Transformer for Fine-grained Image Classification.☆48Updated 11 months ago
- [ECCV'24] Official Implementation of SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance☆125Updated 6 months ago
- [NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"☆282Updated 11 months ago
- When do we not need larger vision models?☆376Updated last month
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆101Updated 3 months ago
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆120Updated 5 months ago
- Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆43Updated last week
- This repo is the official implementation of iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.☆36Updated 3 months ago
- [CVPR 2024] ViT-Lens: Towards Omni-modal Representations☆171Updated last month
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference☆147Updated 5 months ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 6 months ago
- Open source implementation of "Vision Transformers Need Registers"☆168Updated last month
- [CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"☆192Updated 10 months ago
- Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding☆164Updated last month