UbiquitousLearning / SLM_SurveyLinks
☆95Updated 8 months ago
Alternatives and similar repositories for SLM_Survey
Users that are interested in SLM_Survey are comparing it to the libraries listed below
Sorting:
- Simple extension on vLLM to help you speed up reasoning model without training.☆152Updated this week
- ☆89Updated last week
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆156Updated last month
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆36Updated 3 months ago
- a curated list of the role of small models in the LLM era☆100Updated 8 months ago
- ☆37Updated 7 months ago
- ☆45Updated 3 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆47Updated last month
- ☆79Updated 4 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆116Updated 11 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆59Updated last month
- Survey of Small Language Models from Penn State, ...☆180Updated 2 weeks ago
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆116Updated 6 months ago
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆161Updated 11 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆143Updated 8 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆106Updated 3 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆107Updated 2 weeks ago
- ☆83Updated 2 weeks ago
- FuseAI Project☆87Updated 4 months ago
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆39Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆114Updated last year
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆126Updated last week
- ☆125Updated last year
- minimal GRPO implementation from scratch☆90Updated 2 months ago
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆60Updated last year
- ☆105Updated 2 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated this week
- ☆72Updated last month
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆59Updated 7 months ago