ZihaoHuang-notabot / Ultra-Sparse-Memory-NetworkLinks
☆34Updated 3 months ago
Alternatives and similar repositories for Ultra-Sparse-Memory-Network
Users that are interested in Ultra-Sparse-Memory-Network are comparing it to the libraries listed below
Sorting:
- The official repo of continuous speculative decoding☆31Updated 8 months ago
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆14Updated last month
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆52Updated 4 months ago
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆46Updated 5 months ago
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆20Updated 9 months ago
- ☆23Updated 6 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆27Updated 4 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆40Updated 9 months ago
- ☆34Updated 7 months ago
- Official implementation of ECCV24 paper: POA☆24Updated last year
- ☆19Updated 11 months ago
- ☆39Updated 7 months ago
- ☆126Updated this week
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆45Updated 3 weeks ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆40Updated 10 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆17Updated 9 months ago
- ☆46Updated last year
- Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks☆33Updated 3 weeks ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆17Updated 9 months ago
- VideoNSA: Native Sparse Attention Scales Video Understanding☆70Updated last month
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆22Updated 5 months ago
- [ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning☆134Updated last month
- ☆72Updated 5 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆115Updated last month
- [NeurIPS 2025 Oral] Exploring Diffusion Transformer Designs via Grafting☆67Updated 6 months ago
- ☆21Updated 3 months ago
- Geometric-Mean Policy Optimization☆95Updated last month
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆55Updated last year
- Resa: Transparent Reasoning Models via SAEs☆46Updated 2 months ago