UbiquitousLearning / SLM_SurveyLinks
☆95Updated 8 months ago
Alternatives and similar repositories for SLM_Survey
Users that are interested in SLM_Survey are comparing it to the libraries listed below
Sorting:
- FuseAI Project☆87Updated 5 months ago
- ☆34Updated last month
- ☆37Updated 8 months ago
- ☆126Updated last year
- Simple extension on vLLM to help you speed up reasoning model without training.☆161Updated 3 weeks ago
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆121Updated 5 months ago
- ☆117Updated 3 months ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆158Updated 2 months ago
- This is the official repository for Inheritune.☆111Updated 4 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆172Updated this week
- ☆80Updated 5 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆37Updated 4 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆144Updated 9 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated this week
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆115Updated last year
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆74Updated 8 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆117Updated last year
- a curated list of the role of small models in the LLM era☆101Updated 9 months ago
- ☆73Updated 2 months ago
- Complex Function Calling Benchmark.☆114Updated 5 months ago
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆74Updated last year
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆116Updated 6 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 9 months ago
- Survey of Small Language Models from Penn State, ...☆183Updated last month
- Code for KaLM-Embedding models☆78Updated 3 months ago
- ☆198Updated 6 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆47Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 9 months ago
- ☆64Updated last year
- Verifiers for LLM Reinforcement Learning☆60Updated 2 months ago