tigerchen52 / awesome_role_of_small_modelsLinks
a curated list of the role of small models in the LLM era
☆101Updated 9 months ago
Alternatives and similar repositories for awesome_role_of_small_models
Users that are interested in awesome_role_of_small_models are comparing it to the libraries listed below
Sorting:
- ☆45Updated last month
- Survey of Small Language Models from Penn State, ...☆183Updated last month
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆74Updated 8 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆184Updated last year
- EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.☆133Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆137Updated 7 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 9 months ago
- ☆86Updated last month
- This is the official repository for Inheritune.☆111Updated 4 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆66Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆251Updated 2 weeks ago
- Code implementation of synthetic continued pretraining☆114Updated 5 months ago
- Large language models for document ranking.☆58Updated last month
- [ACL'25] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆64Updated this week
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆37Updated 3 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Updated 8 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆78Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆115Updated last year
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆108Updated 2 weeks ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆89Updated 6 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆144Updated 7 months ago
- ☆48Updated 3 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆106Updated 8 months ago
- ☆117Updated 3 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆38Updated 3 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆105Updated 5 months ago
- [IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection☆87Updated last year
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆107Updated 8 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆170Updated 2 weeks ago