ZihaoHuang-notabot / Ultra-Sparse-Memory-NetworkLinks
☆26Updated last week
Alternatives and similar repositories for Ultra-Sparse-Memory-Network
Users that are interested in Ultra-Sparse-Memory-Network are comparing it to the libraries listed below
Sorting:
- The official repo of continuous speculative decoding☆29Updated 6 months ago
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆13Updated 6 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆48Updated 2 months ago
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆43Updated 2 months ago
- ☆21Updated 4 months ago
- Official implementation of ECCV24 paper: POA☆24Updated last year
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆19Updated 7 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆16Updated 6 months ago
- [NeurIPS 2025 Oral] Exploring Diffusion Transformer Designs via Grafting☆52Updated 3 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆95Updated last month
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆26Updated 2 months ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- Memory Efficient Training Framework for Large Video Generation Model☆25Updated last year
- ☆39Updated 4 months ago
- ☆34Updated 4 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Updated 9 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆39Updated 7 months ago
- Geometric-Mean Policy Optimization☆80Updated last month
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆28Updated last year
- ☆43Updated 10 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆31Updated 4 months ago
- ☆24Updated last month
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models☆22Updated 6 months ago
- 😊 TPTT: Transforming Pretrained Transformers into Titans☆27Updated last week
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆56Updated last year
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆113Updated 3 months ago
- ☆56Updated 2 months ago
- Resa: Transparent Reasoning Models via SAEs☆41Updated this week
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆39Updated 6 months ago