FairyFali / SLMs-SurveyLinks
Survey of Small Language Models from Penn State, ...
☆204Updated last week
Alternatives and similar repositories for SLMs-Survey
Users that are interested in SLMs-Survey are comparing it to the libraries listed below
Sorting:
- ☆143Updated 6 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆153Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆190Updated last year
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆124Updated 8 months ago
- ☆436Updated 2 months ago
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆117Updated 4 months ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆188Updated 2 months ago
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆208Updated 3 weeks ago
- A Comprehensive Survey on Long Context Language Modeling☆191Updated 3 months ago
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"☆137Updated this week
- A collection of 150+ surveys on LLMs☆334Updated 7 months ago
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆361Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆263Updated 3 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆217Updated 2 months ago
- a curated list of the role of small models in the LLM era☆105Updated last year
- [SIGIR'24] The official implementation code of MOELoRA.☆182Updated last year
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆138Updated 11 months ago
- A Survey on Data Selection for Language Models☆250Updated 5 months ago
- ☆104Updated 10 months ago
- ☆239Updated last year
- ☆67Updated 3 months ago
- A curated list of Large Language Model with RAG☆81Updated last year
- ☆129Updated 7 months ago
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆108Updated 11 months ago
- ☆133Updated 3 weeks ago
- Code implementation of synthetic continued pretraining☆133Updated 9 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆256Updated 4 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆63Updated 8 months ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆115Updated 4 months ago
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆64Updated 7 months ago