Official implement of paper "Revisiting Multimodal Positional Encoding in VisionโLanguage Models", ICLR 2026
โ87May 4, 2026Updated last month
Alternatives and similar repositories for Multimodal-RoPEs
Users that are interested in Multimodal-RoPEs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Block-Recurrent Dynamics in ViTs ๐ฆโ43May 21, 2026Updated 2 weeks ago
- โ85Feb 5, 2026Updated 4 months ago
- [SIGGRAPH 2026] OmniRoam: World Wandering via Long-Horizon Panoramic Video Generationโ99Apr 8, 2026Updated 2 months ago
- SKT A.X LLM 3.1โ13Jul 24, 2025Updated 10 months ago
- [CVPR 2022 Oral] Faithful Extreme Rescaling via Generative Prior Reciprocated Invertible Representationsโ13Jul 14, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- collection with description of super-resolution related papers, repositories, datasets, loss functions and etc.โ11Dec 12, 2023Updated 2 years ago
- SKT A.X LLM K1โ30Feb 11, 2026Updated 4 months ago
- โ15Jan 12, 2024Updated 2 years ago
- Learning Debiased and Disentangled Representations for Semantic Segmentation (NeurIPS 2021)โ13Jan 23, 2022Updated 4 years ago
- โ39Jun 2, 2026Updated last week
- โ10Aug 29, 2024Updated last year
- [MedIA 2025] MambaMIM: Pre-training Mamba with State Space Token Interpolation and its Application to Medical Image Segmentationโ41Aug 10, 2025Updated 10 months ago
- โ22Sep 26, 2024Updated last year
- [ICML 2026] d3LLM: Ultra-Fast Diffusion LLM ๐โ138May 1, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI โข AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)โ101May 13, 2026Updated 3 weeks ago
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimizationโ68Sep 19, 2025Updated 8 months ago
- โ53Aug 22, 2025Updated 9 months ago
- [ICCV 2025] SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMsโ85Jan 17, 2026Updated 4 months ago
- [ICCV 2025] Official PyTorch Code for "Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval"โ18Aug 23, 2025Updated 9 months ago
- โ32Mar 4, 2025Updated last year
- https://www.kaggle.com/c/nbme-score-clinical-patient-notesโ10Sep 1, 2022Updated 3 years ago
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Visionโ228May 31, 2026Updated last week
- ๐ ๆๆๆๆไฝ ๅจ่ฎบๆไธญๆๅ ฅไปฃ็ ้พๆฅโ25Aug 2, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient โข AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.โ29Oct 19, 2025Updated 7 months ago
- โ35Jun 18, 2024Updated last year
- โ17Apr 21, 2026Updated last month
- โ43Jan 16, 2026Updated 4 months ago
- This is the repo for DenseAttention and DANet - fast and conceptually simple modification of standard attention and Transformerโ20Apr 6, 2026Updated 2 months ago
- SR-DiT Speedrunning ImageNet Diffusionโ138Apr 6, 2026Updated 2 months ago
- CVPR2026โ32Sep 18, 2025Updated 8 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decodingโ37Apr 25, 2026Updated last month
- A Benchmark for Cinematographic Technique Understanding and Generationโ27Sep 19, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer โข AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Easy and Efficient dLLM Fine-Tuningโ257Mar 2, 2026Updated 3 months ago
- ๐The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"โ25Dec 2, 2025Updated 6 months ago
- Official Implementation for Generative Neural Fields by Mixtures of Neural Implicit Functionsโ19Mar 10, 2024Updated 2 years ago
- This is the official repository of Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalitiesโ42Apr 28, 2026Updated last month
- Code of our paper "A Unified Agentic Framework for Evaluating Conditional Image Generation".โ30Jul 22, 2025Updated 10 months ago
- Official Implementation of "Visual-ERM: Reward Modeling for Visual Equivalence"โ63Mar 23, 2026Updated 2 months ago
- โ11Aug 31, 2024Updated last year