OliverRensu / D-iGPTView external linksLinks
[ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Learners"
☆98May 3, 2024Updated last year
Alternatives and similar repositories for D-iGPT
Users that are interested in D-iGPT are comparing it to the libraries listed below
Sorting:
- ☆59Jun 18, 2024Updated last year
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆55Aug 27, 2025Updated 5 months ago
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆90May 30, 2025Updated 8 months ago
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Mar 20, 2025Updated 10 months ago
- A Contrastive Learning Boost from Intermediate Pre-Trained Representations☆43Sep 19, 2024Updated last year
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆320Jun 3, 2024Updated last year
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆14Nov 21, 2025Updated 2 months ago
- Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)☆15Apr 23, 2024Updated last year
- [CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"☆211Jun 9, 2024Updated last year
- PyTorch implementation of paper "StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization" in ICML 2024.☆15Jun 4, 2024Updated last year
- ☆73May 10, 2024Updated last year
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Oct 23, 2023Updated 2 years ago
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,397Aug 4, 2025Updated 6 months ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago
- VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks☆390Jul 9, 2024Updated last year
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆36Apr 21, 2024Updated last year
- [NeurIPS 2025] Completeness-Aware Reconstruction Enhancement☆35Oct 18, 2025Updated 3 months ago
- i-mae Pytorch Repo☆20Apr 6, 2024Updated last year
- This is the official code release for our work, Denoising Vision Transformers.☆393Nov 13, 2024Updated last year
- The official implementation of "CateNorm: Categorical Normalization for Robust Medical Image Segmentation"☆32Sep 30, 2022Updated 3 years ago
- [IEEE TMI] Tumor synthesis leveraging medical reports.☆48Jan 26, 2026Updated 3 weeks ago
- iBOT : Image BERT Pre-Training with Online Tokenizer (ICLR 2022)☆764Apr 14, 2022Updated 3 years ago
- [ISBI 2023] Official Implementation for Label-Assemble☆20Jul 30, 2024Updated last year
- ☆24Jun 17, 2025Updated 7 months ago
- Official implementation of "Meta-Entity Driven Triplet Mining for Aligning Medical Vision-Language Models"☆14Mar 19, 2025Updated 10 months ago
- ☆11Feb 28, 2024Updated last year
- [NeurIPS 2024] Code release for "Segment Anything without Supervision"☆497Nov 20, 2025Updated 2 months ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆129Aug 21, 2024Updated last year
- [TMLR23] Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.☆232Dec 22, 2023Updated 2 years ago
- [TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"☆42Apr 30, 2024Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆993Nov 25, 2025Updated 2 months ago
- Official implementation of DIP: Unsupervised Dense In-Context Post-training of Visual Representations☆46Sep 8, 2025Updated 5 months ago
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆151Jul 24, 2025Updated 6 months ago
- [NeurIPS '25] FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed☆26Jul 26, 2025Updated 6 months ago
- Improving transparency of large language models' reasoning☆14Nov 25, 2025Updated 2 months ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆244Oct 12, 2025Updated 4 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆348Dec 1, 2025Updated 2 months ago
- [ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning☆48Dec 21, 2025Updated last month