FairyFali / SLMs-SurveyLinks
Survey of Small Language Models from Penn State, ...
☆197Updated 3 weeks ago
Alternatives and similar repositories for SLMs-Survey
Users that are interested in SLMs-Survey are comparing it to the libraries listed below
Sorting:
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆115Updated 3 months ago
- ☆138Updated 6 months ago
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆209Updated last week
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆212Updated last month
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆152Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆190Updated last year
- A collection of 150+ surveys on LLMs☆328Updated 7 months ago
- ☆432Updated last month
- a curated list of the role of small models in the LLM era☆105Updated 11 months ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆186Updated last month
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆263Updated 2 months ago
- ☆122Updated 6 months ago
- A Comprehensive Survey on Long Context Language Modeling☆187Updated 2 months ago
- Code implementation of synthetic continued pretraining☆129Updated 8 months ago
- Self-Reflection in LLM Agents: Effects on Problem-Solving Performance☆83Updated 9 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆122Updated 7 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆253Updated 4 months ago
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆362Updated last year
- A curated list of Large Language Model with RAG☆81Updated last year
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆112Updated 4 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆95Updated 8 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆62Updated 8 months ago
- ☆129Updated last year
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆341Updated this week
- ☆103Updated 9 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆139Updated 10 months ago
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆68Updated 2 months ago
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"☆119Updated this week
- ☆37Updated 8 months ago
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆83Updated 11 months ago