Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"
☆133Feb 4, 2026Updated 2 months ago
Alternatives and similar repositories for capi
Users that are interested in capi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2025] Official Implementation for SimDINO/SimDINOv2☆201Mar 15, 2025Updated last year
- ☆33Nov 4, 2024Updated last year
- ☆44Jan 14, 2026Updated 2 months ago
- Official implementation for SSDD Single-Step Diffusion Decoder for Efficient Image Tokenization.☆58Mar 16, 2026Updated 3 weeks ago
- Fast Vision Mamba : Pool your Spatial Dimensions for Accelerated Processing☆18Jan 28, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official code of Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning☆268Sep 24, 2025Updated 6 months ago
- ☆40Oct 31, 2025Updated 5 months ago
- ☆25May 23, 2025Updated 10 months ago
- BigOBench assesses the capacity of Large Language Models (LLMs) to comprehend time-space computational complexity of input or generated c…☆40Apr 15, 2025Updated 11 months ago
- [NeurIPS25 D&B Spotlight] A tile-level histopathology image understanding benchmark☆45Apr 3, 2026Updated last week
- ☆32Jun 6, 2024Updated last year
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆308Jun 2, 2025Updated 10 months ago
- This is a repository that implements the Dense NN Retrieval Evaluation used for evaluating the In-Context Learning Capabilities of Vision…☆30Nov 3, 2025Updated 5 months ago
- research impl of Native Sparse Attention (2502.11089)☆63Feb 19, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆23Dec 4, 2024Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- ☆22Jul 3, 2025Updated 9 months ago
- Learning to Count without Annotations☆23May 24, 2024Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆92Oct 30, 2024Updated last year
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Mar 7, 2025Updated last year
- ☆49Feb 23, 2025Updated last year
- Implementation of the proposed MaskBit from Bytedance AI☆82Nov 12, 2024Updated last year
- Framework to reduce autotune overhead to zero for well known deployments.☆98Sep 19, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆25Mar 26, 2026Updated 2 weeks ago
- Minimal Implimentation of VCRec (2024) for collapse provention.☆18Jan 28, 2025Updated last year
- ☆21Mar 3, 2025Updated last year
- Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurI…☆95Apr 29, 2024Updated last year
- ☆28Oct 7, 2025Updated 6 months ago
- ☆47Jan 31, 2026Updated 2 months ago
- An up-to-date & curated list of awesome semi-supervised segmentation papers, methods & resources.☆13Dec 22, 2023Updated 2 years ago
- Experiment of using Tangent to autodiff triton☆82Jan 22, 2024Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- 🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)☆56Jan 31, 2025Updated last year
- ☆30Dec 2, 2024Updated last year
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆129Oct 18, 2024Updated last year
- Latex template for Oxford integrated thesis☆19Apr 7, 2025Updated last year
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- [IEEE TPAMI] Anomaly Detection in Chest X-ray☆25Jul 30, 2024Updated last year