AutoGaze automatically removes redundant patches in a video, reducing #tokens in ViT/MLLM by 4x-100x.
☆282May 5, 2026Updated 2 weeks ago
Alternatives and similar repositories for AutoGaze
Users that are interested in AutoGaze are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆15Jun 26, 2025Updated 11 months ago
- super-resolution; post-training quantization; model compression☆14Nov 10, 2023Updated 2 years ago
- Pytorch implementation of Google TCAV☆10Jan 11, 2019Updated 7 years ago
- PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation☆23Jun 20, 2025Updated 11 months ago
- Code & data for "Towards flexible perception with visual memory" (ICML 2025)☆19Sep 24, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- M-LSDを用い て四角形を検出し、射影変換を行うサンプルプログラム☆10Jun 2, 2021Updated 4 years ago
- The official implementation of Latte: Latent Diffusion Transformer for Video Generation.☆35Feb 26, 2025Updated last year
- This is a LoRA model finetuned on Wan-I2V-14B-480P. It turns things in the image into fluffy toys.☆19Nov 10, 2025Updated 6 months ago
- ☆54Nov 6, 2025Updated 6 months ago
- 拡散モデルを学びたい初学者向けです。書籍「コンピュータビジョン最前線 Summer 2023」の「イマドキノ拡散モデル」の解説をベースに、CIFER-10で画像生成をします☆19Jul 2, 2023Updated 2 years ago
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆48Jul 3, 2025Updated 10 months ago
- Accepted By The 39th Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track☆25Nov 17, 2025Updated 6 months ago
- ☆44Oct 23, 2025Updated 7 months ago
- Visual haptic using depth image☆19Dec 20, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NeurIPS 2022] (Amortized) distributional control for pre-trained generative models☆121Sep 4, 2023Updated 2 years ago
- Streaming Video Instruction Tuning☆75Feb 25, 2026Updated 3 months ago
- [ICCV 2025] HUMOTO Dataset Code Release☆62Nov 6, 2025Updated 6 months ago
- real-to-sim evaluation suite for robot parkour☆11Jan 19, 2025Updated last year
- [ICML 2026] LatentMorph: Morphing Latent Reasoning into Image Generation☆44May 5, 2026Updated 3 weeks ago
- Self-supervised adversarial masking for point clouds☆11Jul 12, 2023Updated 2 years ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆16Apr 29, 2025Updated last year
- [ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"☆74Jan 13, 2026Updated 4 months ago
- Keras implementation of EfficientNet model for age and gender estimation☆14Mar 18, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆57Mar 5, 2026Updated 2 months ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆14Apr 29, 2025Updated last year
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆40May 4, 2026Updated 3 weeks ago
- An CUDA-based library for computed tomography (CT) reconstruction with differentiable operators.☆20Updated this week
- Code for Paper "The Geometry of Reasoning: Flowing Logics in Representation Space" (ICLR 2026)☆48Jan 31, 2026Updated 3 months ago
- PyTorch implementation of the Mamba-3 architecture☆107Mar 18, 2026Updated 2 months ago
- ☆11Apr 5, 2023Updated 3 years ago
- FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens☆17Sep 8, 2025Updated 8 months ago
- ☆14Jul 8, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆40Nov 27, 2024Updated last year
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆28Jun 16, 2025Updated 11 months ago
- Siggraph 2025 Journal track☆26Aug 13, 2025Updated 9 months ago
- [ICCV 2023] Official repository for "Tree-Structured Shading Decomposition"☆45Jan 2, 2025Updated last year
- This is the official training code of OmniSVG☆41Jan 19, 2026Updated 4 months ago
- [ICCV 2023] PATMAT Person Aware Tuning of Mask Aware Transformer for Face Inpainting☆30Jan 5, 2024Updated 2 years ago
- [ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos☆521Feb 11, 2026Updated 3 months ago