FairyFali / SLMs-SurveyLinks
Survey of Small Language Models from Penn State, ...
☆190Updated this week
Alternatives and similar repositories for SLMs-Survey
Users that are interested in SLMs-Survey are comparing it to the libraries listed below
Sorting:
- ☆132Updated 5 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆188Updated last year
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆207Updated this week
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆150Updated last year
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆209Updated last month
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆114Updated 2 months ago
- A Comprehensive Survey on Long Context Language Modeling☆180Updated last month
- A collection of 150+ surveys on LLMs☆325Updated 6 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆258Updated last month
- ☆405Updated last month
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆118Updated 6 months ago
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆107Updated 10 months ago
- a curated list of the role of small models in the LLM era☆104Updated 11 months ago
- Code implementation of synthetic continued pretraining☆125Updated 7 months ago
- ☆103Updated 8 months ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆99Updated 3 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆139Updated 9 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆61Updated 7 months ago
- ☆147Updated 3 months ago
- [ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Model…☆143Updated 3 months ago
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆152Updated this week
- Self-Reflection in LLM Agents: Effects on Problem-Solving Performance☆83Updated 9 months ago
- ☆120Updated 5 months ago
- ☆127Updated last week
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆360Updated 11 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆248Updated 3 months ago
- ☆67Updated 2 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆171Updated last month
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆174Updated 2 months ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆182Updated 2 weeks ago