ZihaoHuang-notabot / Ultra-Sparse-Memory-NetworkLinks
☆30Updated last month
Alternatives and similar repositories for Ultra-Sparse-Memory-Network
Users that are interested in Ultra-Sparse-Memory-Network are comparing it to the libraries listed below
Sorting:
- The official repo of continuous speculative decoding☆30Updated 7 months ago
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆14Updated 7 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆16Updated 7 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆51Updated 3 months ago
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆20Updated 8 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Updated 10 months ago
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆45Updated 3 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆27Updated last year
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆22Updated 3 months ago
- MobileLLM-R1☆55Updated last month
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models☆22Updated 7 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated last year
- Resa: Transparent Reasoning Models via SAEs☆44Updated last month
- Official implementation of ECCV24 paper: POA☆24Updated last year
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- VideoNSA: Native Sparse Attention Scales Video Understanding☆54Updated last week
- ☆44Updated last year
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆39Updated 8 months ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆28Updated last year
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆26Updated 3 months ago
- ☆21Updated 5 months ago
- ☆34Updated 5 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆33Updated 2 weeks ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆31Updated 6 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆43Updated last year
- [AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?☆23Updated 3 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Updated 7 months ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆81Updated last year
- 😊 TPTT: Transforming Pretrained Transformers into Titans☆29Updated 3 weeks ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆103Updated last week