FairyFali / SLMs-SurveyLinks
Survey of Small Language Models from Penn State, ...
☆180Updated 2 weeks ago
Alternatives and similar repositories for SLMs-Survey
Users that are interested in SLMs-Survey are comparing it to the libraries listed below
Sorting:
- ☆111Updated 2 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆139Updated 5 months ago
- ☆349Updated this week
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆213Updated 3 weeks ago
- a curated list of the role of small models in the LLM era☆100Updated 8 months ago
- A Comprehensive Survey on Long Context Language Modeling☆147Updated 2 weeks ago
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆198Updated last week
- ☆94Updated 5 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆248Updated 3 weeks ago
- ☆105Updated 2 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆183Updated last year
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆141Updated 5 months ago
- The official code of paper “Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning”☆117Updated this week
- ☆140Updated 4 months ago
- ☆193Updated last week
- ☆60Updated 2 weeks ago
- LLM hallucination paper list☆316Updated last year
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆186Updated this week
- ☆210Updated last week
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆178Updated 6 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated 11 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆141Updated 11 months ago
- Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆414Updated 2 weeks ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆261Updated last year
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆207Updated last month
- This is the repository for the Tool Learning survey.☆387Updated 2 weeks ago
- ☆102Updated 6 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆157Updated 9 months ago
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆106Updated last month
- ☆95Updated 8 months ago