sail-sg / LightTransLinks
The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"
☆20Updated 2 months ago
Alternatives and similar repositories for LightTrans
Users that are interested in LightTrans are comparing it to the libraries listed below
Sorting:
- ☆15Updated 3 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆42Updated this week
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆38Updated 4 months ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆91Updated 2 months ago
- Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?☆112Updated 8 months ago
- ☆51Updated last week
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆77Updated 5 months ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆52Updated 5 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆99Updated last week
- ☆22Updated last year
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆27Updated 4 months ago
- ☆18Updated 4 months ago
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆46Updated 9 months ago
- ☆18Updated 6 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆26Updated 7 months ago
- Code and Model for NeurIPS 2024 Spotlight Paper "Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training…☆42Updated 9 months ago
- ☆91Updated 2 months ago
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆52Updated 4 months ago
- ☆75Updated last week
- PyTorch implementation of StableMask (ICML'24)☆13Updated last year
- LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆60Updated this week
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Updated 4 months ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆47Updated 2 months ago
- Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.☆87Updated last month
- ☆110Updated last month
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆85Updated 3 weeks ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆53Updated last year
- [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"☆38Updated last year
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆127Updated this week
- ☆125Updated last month