FairyFali / SLMs-SurveyLinks
Survey of Small Language Models from Penn State, ...
☆214Updated 2 weeks ago
Alternatives and similar repositories for SLMs-Survey
Users that are interested in SLMs-Survey are comparing it to the libraries listed below
Sorting:
- ☆154Updated last month
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆119Updated last month
- A Comprehensive Survey on Long Context Language Modeling☆203Updated 4 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆161Updated last year
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆210Updated 3 weeks ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆263Updated 4 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆131Updated 9 months ago
- A collection of 150+ surveys on LLMs☆341Updated 9 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆222Updated 3 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆141Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆191Updated last year
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆189Updated 4 months ago
- [ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Model…☆154Updated 5 months ago
- ☆104Updated 11 months ago
- ☆37Updated 10 months ago
- a curated list of the role of small models in the LLM era☆109Updated last year
- ☆165Updated last month
- Code implementation of synthetic continued pretraining☆138Updated 10 months ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆139Updated 6 months ago
- [ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆82Updated last year
- ☆100Updated last year
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆63Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆257Updated 6 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆44Updated last year
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆404Updated last week
- ☆462Updated 3 months ago
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆64Updated 8 months ago
- The All-in-one Judge Models introduced by Opencompass☆114Updated 4 months ago
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"☆161Updated 2 weeks ago
- ☆136Updated 2 months ago