AutoGaze automatically removes redundant patches in a video, reducing #tokens in ViT/MLLM by 4x-100x.
☆243Mar 19, 2026Updated 3 weeks ago
Alternatives and similar repositories for AutoGaze
Users that are interested in AutoGaze are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆14Jun 26, 2025Updated 9 months ago
- super-resolution; post-training quantization; model compression☆14Nov 10, 2023Updated 2 years ago
- PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation☆22Jun 20, 2025Updated 9 months ago
- Code & data for "Towards flexible perception with visual memory" (ICML 2025)☆18Sep 24, 2024Updated last year
- ☆14Dec 11, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The official implementation of Latte: Latent Diffusion Transformer for Video Generation.☆35Feb 26, 2025Updated last year
- Evaluating Multiview Object Correspondence between Humans and Image models☆20Feb 12, 2025Updated last year
- Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding☆40Mar 16, 2025Updated last year
- ☆13Nov 29, 2024Updated last year
- [COLM 2025] DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation; 知乎:https://zhuanlan.zhihu.c…☆30Mar 5, 2025Updated last year
- 拡散モデルを学びたい初学者向けです。書籍「コンピュータビジョン最前線 Summer 2023」の「イマドキノ拡散モデル」の解説をベースに、CIFER-10で画像生成をします☆19Jul 2, 2023Updated 2 years ago
- ☆20Jul 13, 2022Updated 3 years ago
- [CVPR 2024] Generative Unlearning for Any Identity☆35Feb 19, 2025Updated last year
- Visual haptic using depth image☆19Dec 20, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [NeurIPS 2022] (Amortized) distributional control for pre-trained generative models☆121Sep 4, 2023Updated 2 years ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆16Apr 29, 2025Updated 11 months ago
- ☆57Mar 5, 2026Updated last month
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆14Apr 29, 2025Updated 11 months ago
- [ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"☆16May 31, 2024Updated last year
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆39Jun 20, 2025Updated 9 months ago
- An CUDA-based library for computed tomography (CT) reconstruction with differentiable operators.☆18Mar 25, 2026Updated 3 weeks ago
- ☆20Mar 16, 2020Updated 6 years ago
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆40Nov 27, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Franka simulator in Drake compatible with existing libfranka programs☆24Aug 29, 2025Updated 7 months ago
- ☆11Apr 5, 2023Updated 3 years ago
- FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens☆17Sep 8, 2025Updated 7 months ago
- [NeurIPS 2025] PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation☆122Feb 22, 2026Updated last month
- ☆14Jul 8, 2023Updated 2 years ago
- Python implementation of HlibertSort for sorting 3D point clouds using space-filling curves☆11Apr 17, 2019Updated 6 years ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆27Jun 16, 2025Updated 10 months ago
- Siggraph 2025 Journal track☆26Aug 13, 2025Updated 8 months ago
- [ICCV 2023] Official repository for "Tree-Structured Shading Decomposition"☆45Jan 2, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Aug 14, 2025Updated 8 months ago
- [ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos☆492Feb 11, 2026Updated 2 months ago
- [NeurIPS 2025] SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation and Understanding☆43Nov 30, 2025Updated 4 months ago
- SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…☆11Jul 9, 2025Updated 9 months ago
- [CVPR 2025] QuartDepth☆17Mar 24, 2025Updated last year
- Official implementation of "LoFA: Learning to Predict Personalized Prior for Fast Adaptation of Visual Generative Models".☆38Feb 1, 2026Updated 2 months ago
- FlexiFilm: Long Video Generation with Flexible Conditions☆31May 1, 2024Updated last year