Fork of Flame repo for training of some new stuff in development
☆19Feb 27, 2026Updated last week
Alternatives and similar repositories for flame
Users that are interested in flame are comparing it to the libraries listed below
Sorting:
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆88Sep 12, 2025Updated 5 months ago
- ☆21Jul 21, 2025Updated 7 months ago
- The Official PyTorch Implementation of "Brain-like Variational Inference" (NeurIPS 2025 Paper)☆68Feb 9, 2026Updated 3 weeks ago
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated last year
- ☆19Jun 4, 2025Updated 9 months ago
- ☆16Dec 9, 2023Updated 2 years ago
- ☆15Oct 4, 2024Updated last year
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆16Jul 26, 2024Updated last year
- ☆19Aug 4, 2025Updated 7 months ago
- BESA is a differentiable weight pruning technique for large language models.☆17Mar 4, 2024Updated 2 years ago
- ☆16Oct 20, 2025Updated 4 months ago
- Transmute AI Lab Model Efficiency Toolkit☆19Oct 2, 2023Updated 2 years ago
- ☆11Feb 5, 2026Updated last month
- [ICLR 2025 Oral] Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition☆19Nov 25, 2024Updated last year
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Dec 27, 2024Updated last year
- ☆23Jan 27, 2025Updated last year
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- Official Repository for Task-Circuit Quantization☆24Jun 1, 2025Updated 9 months ago
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆42Jul 26, 2025Updated 7 months ago
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆22Nov 9, 2025Updated 3 months ago
- [JAG'26] SpatialLLM: From Multi-modality Data to Urban Spatial Intelligence☆59Jan 8, 2026Updated last month
- FireQ: Fast INT4-FP8 Kernel and RoPE-aware Quantization for LLM Inference Acceleration☆20Jun 27, 2025Updated 8 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆29Jul 24, 2025Updated 7 months ago
- Repository for code used in the xVal paper☆148Apr 4, 2024Updated last year
- ☆39Oct 31, 2025Updated 4 months ago
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆156Apr 7, 2025Updated 10 months ago
- Masked Structural Growth for 2x Faster Language Model Pre-training☆25Apr 28, 2024Updated last year
- ☆33Oct 4, 2024Updated last year
- Pytorch code for paper QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models☆25Sep 27, 2023Updated 2 years ago
- Tools to simplify life with AI☆30Apr 4, 2025Updated 11 months ago
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆30Apr 27, 2024Updated last year
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆26Apr 15, 2025Updated 10 months ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆46Jul 24, 2025Updated 7 months ago
- Cascade Speculative Drafting☆33Apr 2, 2024Updated last year
- Plan✕ is a platform for creating and publishing digital planning services☆17Updated this week
- Superposition Yields Robust Neural Scaling☆58Feb 12, 2026Updated 3 weeks ago
- [NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning☆33Jun 2, 2023Updated 2 years ago