dsb-ifi / SPiTView external linksLinks
A Spitting Image: Modular Superpixel Tokenization in Vision Transformers
☆21Sep 12, 2025Updated 5 months ago
Alternatives and similar repositories for SPiT
Users that are interested in SPiT are comparing it to the libraries listed below
Sorting:
- Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens☆44Mar 24, 2025Updated 10 months ago
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆62Aug 6, 2025Updated 6 months ago
- ☆21Sep 16, 2024Updated last year
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆23Mar 18, 2025Updated 10 months ago
- Joint Modelling Histology and Molecular Markers for Glioma Classification☆12Jun 4, 2025Updated 8 months ago
- [IEEE TIP 2024] Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model☆34Apr 24, 2024Updated last year
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- ☆30Jul 14, 2022Updated 3 years ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆95Mar 1, 2025Updated 11 months ago
- Towards training VQ-VAE models robustly!☆91Jul 14, 2025Updated 6 months ago
- PyTorch Implementation of the CVPR'24 Paper "Learned Lossless Image Compression based on Bit Plane Slicing"☆37Mar 6, 2025Updated 11 months ago
- [CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention☆39Mar 11, 2025Updated 11 months ago
- Code for Breaking the Frame: Visual Place Recognition by Overlap Prediction☆39Jan 4, 2025Updated last year
- Embodied Question Answering (EQA) benchmark and method (ICCV 2025)☆46Aug 12, 2025Updated 6 months ago
- ☆10Apr 24, 2024Updated last year
- Multi-level Attention Network for Retinal Vessel Segmentation☆11May 10, 2021Updated 4 years ago
- ☆10Oct 5, 2022Updated 3 years ago
- Turbo coder and decoder☆12Oct 11, 2023Updated 2 years ago
- The code for Spectral Super-Resolution via Deep Low-Rank Tensor Representation☆11Mar 21, 2024Updated last year
- Alpha64 R10000 Two-Way Superscalar Processor☆11May 6, 2019Updated 6 years ago
- Agent-to-Sim Learning Interactive Behavior from Casual Videos.☆48Oct 16, 2024Updated last year
- Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting☆50Mar 11, 2025Updated 11 months ago
- Jtag parsing scripts☆10Oct 14, 2023Updated 2 years ago
- ☆20Oct 15, 2025Updated 3 months ago
- The official pytorch implementation of the paper PromptHSI: Universal Hyperspectral Image Restoration Framework for Composite Degradation…☆15Nov 15, 2025Updated 2 months ago
- A general slow DDR3 interface. Very little resource consumption. Suits for all FPGAs with 1.5V IO voltage.☆11Dec 14, 2022Updated 3 years ago
- Inference demo for the MICCAI-2020 paper "Self-supervision on Unlabelled OR Data for Multi-person 2D/3D Human Pose Estimation"☆11May 23, 2025Updated 8 months ago
- Energy-based Dropout and Pruning of Deep Neural Networks☆10Oct 9, 2020Updated 5 years ago
- a Video Quality Analysis Toolkit☆13May 16, 2025Updated 8 months ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- ☆12Dec 19, 2024Updated last year
- ☆12May 20, 2025Updated 8 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Jan 30, 2026Updated 2 weeks ago
- "Causality: Models, Reasoning, and Inference-Judea Pearl(2009)"中文翻译及学习笔记☆15Feb 18, 2022Updated 3 years ago
- Conditional Latent Coding (CLC) for Deep Image Compression☆15Feb 6, 2026Updated last week
- ☆14Dec 20, 2022Updated 3 years ago
- Implementation of the MMDAgent for use as a live receptionist in Carnegie Mellon's School of Computer Science.☆15Apr 11, 2013Updated 12 years ago
- ☆11Jan 11, 2025Updated last year
- KiCad RF Stuff☆14Aug 17, 2021Updated 4 years ago