ZihaoHuang-notabot / Ultra-Sparse-Memory-NetworkLinks
☆37Updated 3 months ago
Alternatives and similar repositories for Ultra-Sparse-Memory-Network
Users that are interested in Ultra-Sparse-Memory-Network are comparing it to the libraries listed below
Sorting:
- The official repo of continuous speculative decoding☆31Updated 9 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆17Updated 10 months ago
- Learning to Skip the Middle Layers of Transformers☆16Updated 5 months ago
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆20Updated 10 months ago
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆46Updated 5 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆51Updated 5 months ago
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆14Updated last month
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Updated last year
- Official implementation of ECCV24 paper: POA☆24Updated last year
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆27Updated 5 months ago
- MobileLLM-R1☆72Updated 3 months ago
- Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"☆49Updated 2 weeks ago
- ☆46Updated last year
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆118Updated 2 months ago
- ☆191Updated 3 weeks ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Updated 9 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Updated 6 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆37Updated 2 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆41Updated last week
- Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks☆33Updated last month
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆31Updated 8 months ago
- VideoNSA: Native Sparse Attention Scales Video Understanding☆76Updated last month
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆40Updated 10 months ago
- Geometric-Mean Policy Optimization☆96Updated last month
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆69Updated 2 weeks ago
- Resa: Transparent Reasoning Models via SAEs☆47Updated 3 months ago
- ☆23Updated 7 months ago
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆22Updated 5 months ago
- ☆72Updated 6 months ago