tigerchen52 / awesome_role_of_small_models
a curated list of the role of small models in the LLM era
☆95Updated 6 months ago
Alternatives and similar repositories for awesome_role_of_small_models:
Users that are interested in awesome_role_of_small_models are comparing it to the libraries listed below
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆131Updated 4 months ago
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 6 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆110Updated 10 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆78Updated last month
- This is the official repository for Inheritune.☆109Updated last month
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆70Updated 3 months ago
- The first dense retrieval model that can be prompted like an LM☆67Updated 6 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆45Updated last month
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆53Updated 5 months ago
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆52Updated 5 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆30Updated 9 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated 3 weeks ago
- ☆34Updated 5 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆229Updated last month
- Survey of Small Language Models from Penn State, ...☆169Updated 2 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆179Updated 11 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆119Updated 7 months ago
- ☆40Updated last month
- ☆142Updated 11 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆62Updated 9 months ago
- ☆82Updated 4 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆37Updated last month
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆45Updated 4 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆84Updated last year
- Source code of our paper "PairDistill: Pairwise Relevance Distillation for Dense Retrieval", EMNLP 2024 Main.☆22Updated 3 months ago
- Codebase for Instruction Following without Instruction Tuning☆33Updated 6 months ago
- A pipeline for LLM knowledge distillation☆98Updated last month