rccchoudhury/apt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rccchoudhury/apt)

rccchoudhury / apt

Public release of the code for "Accelerating Vision Transformers with Adaptive Patches"

☆114

Alternatives and similar repositories for apt

Users that are interested in apt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

1069066484 / PanoSwinTransformerObjectDetection
View on GitHub
☆18Jun 9, 2023Updated 3 years ago
amazon-far / deltatok
View on GitHub
[CVPR 2026 Highlight] A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens
☆208Updated this week
tiiuae / siglino
View on GitHub
AMoE: Agglomerative Mixture-of-Experts Vision Foundation Models
☆53Jun 11, 2026Updated last month
neuroailab / SpelkeNet
View on GitHub
☆15Jul 23, 2025Updated 11 months ago
davnords / MuM
View on GitHub
[CVPR26] MuM's a pretty good feature extractor for 3D tasks, probably the best.
☆120Jul 14, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Junggy / SCRREAM
View on GitHub
☆23Dec 30, 2024Updated last year
OpenGVLab / Mono-InternVL
View on GitHub
[CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
☆109Jul 18, 2025Updated last year
Keely-Ai / F2D2
View on GitHub
Joint Distillation for Fast Likelihood Evaluation and Sampling in Flow-based Models
☆22Mar 5, 2026Updated 4 months ago
apple / ml-flextok
View on GitHub
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
☆321Jun 2, 2025Updated last year
wrchen530 / batrack
View on GitHub
[ICCV 2025 Oral] Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction (BA-Track)
☆101Nov 25, 2025Updated 7 months ago
mlpc-ucsd / OverLayBench
View on GitHub
(NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps
☆27May 4, 2026Updated 2 months ago
benbergner / cropr
View on GitHub
A token pruning method that accelerates ViTs for various tasks while maintaining high performance.
☆29Jul 21, 2025Updated last year
zcq15 / ACDNet
View on GitHub
☆24Jan 10, 2022Updated 4 years ago
Visual-AI / speed3r
View on GitHub
[CVPR 2026 Findings] Speed3R: Sparse Feed-forward 3D Reconstruction Models
☆75Apr 7, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
MoeinSorkhei / APLA
View on GitHub
Code for APLA: A Simple Adaptation Method for Vision Transformers
☆16Apr 3, 2025Updated last year
mvp18 / 3DConsistency-metrics
View on GitHub
Official Code for "Can These Views Be One Scene?"
☆16May 22, 2026Updated last month
VisualSphinx / VisualSphinx
View on GitHub
☆17Jun 3, 2025Updated last year
WenjieShu / LoopViT
View on GitHub
☆45Feb 4, 2026Updated 5 months ago
JooHyoSeok / ScaleMaster-Dataset
View on GitHub
ScaleMaster-Dataset
☆17May 11, 2026Updated 2 months ago
liruilong940607 / prope
View on GitHub
Cameras as Relative Positional Encoding
☆739Dec 18, 2025Updated 7 months ago
wangf3014 / VTok
View on GitHub
Official implementation of VTok: A Unified Video Tokenizer with Decoupled Spatial-Temporal Latents
☆15Feb 5, 2026Updated 5 months ago
NVlabs / RADIO
View on GitHub
Official repository for "AM-RADIO: Reduce All Domains Into One"
☆1,897May 29, 2026Updated last month
pollen-robotics / ReachyTeleoperation
View on GitHub
Unity app for teleoperating Reachy
☆19Sep 12, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lvsn / UniLight
View on GitHub
The official repo of the CVPR 2026 paper UniLight
☆17May 7, 2026Updated 2 months ago
facebookresearch / EUPE
View on GitHub
Efficient Universal Perception Encoder: a single on-device vision encoder with versatile representations that match or exceed specialized…
☆685Apr 14, 2026Updated 3 months ago
SihanXU / nepa
View on GitHub
PyTorch implementation of NEPA
☆338Feb 9, 2026Updated 5 months ago
PKU-YuanGroup / UniSandBox
View on GitHub
Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward
☆60Nov 27, 2025Updated 7 months ago
facebookresearch / webssl
View on GitHub
Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).
☆214Mar 20, 2026Updated 4 months ago
hanlinm2 / projective-geometry
View on GitHub
[CVPR 2024] Shadows Don’t Lie and Lines Can’t Bend! Generative Models don’t know Projective Geometry...for now
☆49Jun 19, 2024Updated 2 years ago
wzhan24 / UniMate
View on GitHub
☆11Jun 22, 2025Updated last year
NVlabs / finite-difference-flow-optimization
View on GitHub
FDFO: Finite Difference Flow Optimization
☆115Apr 27, 2026Updated 2 months ago
ByteDance-Seed / SAIL
View on GitHub
Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"
☆85Oct 29, 2025Updated 8 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
JoseponLee / IntentQA
View on GitHub
Official repository for "IntentQA: Context-aware Video Intent Reasoning" from ICCV 2023.
☆26Nov 29, 2024Updated last year
wrchen530 / nova3r
View on GitHub
[ICLR 2026] NOVA3R: Non-pixel-aligned Visual Transformer for Amodal 3D Reconstruction
☆152Jun 13, 2026Updated last month
nopQAQ / Test3R
View on GitHub
☆129Jun 17, 2025Updated last year
cgjiahui / WallPlan
View on GitHub
☆23Sep 29, 2022Updated 3 years ago
NVlabs / AutoGaze
View on GitHub
AutoGaze automatically removes redundant patches in a video, reducing #tokens in ViT/MLLM by 4x-100x.
☆297May 5, 2026Updated 2 months ago
a1600012888 / LaCT
View on GitHub
Code release for paper "Test-Time Training Done Right"
☆494Jan 5, 2026Updated 6 months ago
ngbrjyj / A2LC
View on GitHub
[AAAI 2026] A²LC: Active and Automated Label Correction for Semantic Segmentation
☆16Jun 25, 2026Updated 3 weeks ago