FairyFali / SLMs-Survey
Survey of Small Language Models from Penn State, ...
β171Updated 2 months ago
Alternatives and similar repositories for SLMs-Survey:
Users that are interested in SLMs-Survey are comparing it to the libraries listed below
- A Survey on Efficient Reasoning for LLMsβ301Updated last week
- π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyondβ160Updated this week
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.β170Updated this week
- Latest Advances on Long Chain-of-Thought Reasoningβ174Updated this week
- β91Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learningβ180Updated 3 weeks ago
- Code implementation of synthetic continued pretrainingβ99Updated 3 months ago
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.β115Updated 6 months ago
- A Comprehensive Survey on Long Context Language Modelingβ129Updated 2 weeks ago
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.β191Updated this week
- β307Updated last week
- β101Updated 4 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"β234Updated 2 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Tokenβ135Updated 9 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancementβ181Updated last year
- β184Updated last month
- β94Updated 3 weeks ago
- [ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Modelβ¦β118Updated 2 months ago
- β92Updated last month
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningβ147Updated 7 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuningβ128Updated 3 months ago
- β151Updated last week
- β278Updated 3 weeks ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMsβ120Updated last month
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Modelsβ79Updated last week
- [NeurIPS 2024] Agent Planning with World Knowledge Modelβ124Updated 3 months ago
- β265Updated 8 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.β112Updated 3 weeks ago
- a curated list of the role of small models in the LLM eraβ98Updated 6 months ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`β170Updated 4 months ago