kyegomez / Vit-RGTSView external linksLinks
Open source implementation of "Vision Transformers Need Registers"
☆210Jan 31, 2026Updated last week
Alternatives and similar repositories for Vit-RGTS
Users that are interested in Vit-RGTS are comparing it to the libraries listed below
Sorting:
- A simple reproducible template to implement AI research papers☆24Sep 9, 2024Updated last year
- ☆80Feb 27, 2025Updated 11 months ago
- ☆25Nov 22, 2024Updated last year
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆17Aug 26, 2023Updated 2 years ago
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆15Dec 11, 2023Updated 2 years ago
- Pytorch implementation of the Gato paper from Deepmind☆12Feb 8, 2023Updated 3 years ago
- ☆54Jan 17, 2025Updated last year
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆95May 17, 2024Updated last year
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆233Jun 1, 2025Updated 8 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆201Feb 5, 2024Updated 2 years ago
- Official implementation for the CVPR 2024 paper CAMEL☆20Jun 20, 2024Updated last year
- Generate High Quality textual or multi-modal datasets with Agents☆18Jun 7, 2023Updated 2 years ago
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆60Nov 30, 2025Updated 2 months ago
- ☆19Nov 25, 2024Updated last year
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Jan 17, 2026Updated 3 weeks ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Oct 22, 2023Updated 2 years ago
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆56Jul 9, 2024Updated last year
- ☆23Jul 8, 2023Updated 2 years ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆62Apr 30, 2024Updated last year
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98May 3, 2024Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Nov 11, 2024Updated last year
- [NeurIPS 2024] Matryoshka Query Transformer for Large Vision-Language Models☆123Jul 1, 2024Updated last year
- PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes☆66May 5, 2024Updated last year
- My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML☆26Jan 16, 2024Updated 2 years ago
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference☆180Oct 10, 2024Updated last year
- PyTorch Implementation for InMaP☆11Oct 28, 2023Updated 2 years ago
- A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…☆12Mar 11, 2024Updated last year
- ☆11Sep 1, 2024Updated last year
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Nov 11, 2024Updated last year
- a suite of finetuned LLMs for atomically precise function calling 🧪☆17Feb 6, 2026Updated last week
- [ECCV'24 Oral] Anytime Continual Learning for Open Vocabulary Classification☆24Oct 17, 2024Updated last year
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Apr 24, 2025Updated 9 months ago
- ☆33Aug 9, 2024Updated last year
- [CBMI 2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".☆32May 12, 2025Updated 9 months ago
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆153Jun 22, 2024Updated last year
- ❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119☆1,214Sep 2, 2023Updated 2 years ago
- ☆13Sep 16, 2022Updated 3 years ago
- A recurrent neural network (RNN) that generates drug-like molecules for drug discovery.☆11May 4, 2022Updated 3 years ago