[Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models
☆43Mar 11, 2025Updated 11 months ago
Alternatives and similar repositories for GMem
Users that are interested in GMem are comparing it to the libraries listed below
Sorting:
- ☆20May 28, 2025Updated 9 months ago
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- ☆52Jun 13, 2025Updated 8 months ago
- [Preprint] UCGM: Unified Continuous Generative Models☆181May 27, 2025Updated 9 months ago
- Official Implementation for Diffusion Models Without Classifier-free Guidance☆171Feb 18, 2025Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- [ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model☆53Oct 12, 2025Updated 4 months ago
- ☆21Jan 17, 2025Updated last year
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆245Oct 12, 2025Updated 4 months ago
- ☆39Apr 27, 2024Updated last year
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 3 months ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Aug 6, 2025Updated 6 months ago
- ☆13Nov 27, 2025Updated 3 months ago
- ☆11Oct 11, 2023Updated 2 years ago
- recipe for training fully-featured self supervised image jepa models☆12Jun 4, 2025Updated 8 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- RWKV-7 mini☆12Mar 29, 2025Updated 11 months ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated last year
- ☆11Jan 16, 2024Updated 2 years ago
- [Oral; Neurips OPT2024 ] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆15Feb 12, 2026Updated 2 weeks ago
- [NeurIPS 2025, Spotlight]: Ambient-o: Training Good models with Bad Data.☆30Jan 21, 2026Updated last month
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆35Nov 24, 2025Updated 3 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆57Mar 10, 2025Updated 11 months ago
- ☆67Mar 21, 2025Updated 11 months ago
- Elucidating The Design Space of Classifier-Guided Diffusion Generation☆32Jan 20, 2024Updated 2 years ago
- Official implementation of Inductive Moment Matching☆574Jul 11, 2025Updated 7 months ago
- ☆22May 11, 2025Updated 9 months ago
- [NeurIPS 2025] ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models☆31Jul 1, 2025Updated 8 months ago
- ☆14Jan 10, 2024Updated 2 years ago
- [AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?☆25Aug 5, 2025Updated 6 months ago
- MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)☆14Nov 14, 2024Updated last year
- MLLM @ Game☆16May 12, 2025Updated 9 months ago
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆174Jun 26, 2025Updated 8 months ago
- ☆110Feb 10, 2026Updated 2 weeks ago
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- [IROS 2025] CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting☆30Jul 25, 2025Updated 7 months ago
- [ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆454Dec 6, 2025Updated 2 months ago
- Official code for the paper "Attention as a Hypernetwork"☆48Jun 22, 2024Updated last year
- ☆15Sep 18, 2023Updated 2 years ago