tigerchen52 / awesome_role_of_small_modelsLinks
a curated list of the role of small models in the LLM era
☆107Updated last year
Alternatives and similar repositories for awesome_role_of_small_models
Users that are interested in awesome_role_of_small_models are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆82Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆191Updated last year
- ☆156Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆87Updated 9 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆122Updated last year
- Large language models for document ranking.☆69Updated last week
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆68Updated last year
- ☆152Updated last month
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆109Updated 5 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆108Updated 11 months ago
- Open Implementations of LLM Analyses☆107Updated last year
- This is the official repository for Inheritune.☆115Updated 9 months ago
- Verifiers for LLM Reinforcement Learning☆79Updated 6 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆155Updated 2 years ago
- [NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? https://aclanthology.org/2024.naa…☆55Updated 3 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆65Updated last year
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆36Updated last year
- This is the code repo for our paper "Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognitio…☆109Updated last month
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆100Updated last month
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆137Updated last month
- minimal GRPO implementation from scratch☆99Updated 8 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆144Updated last year
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆56Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆101Updated 11 months ago
- ☆50Updated 5 months ago
- ☆98Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆109Updated last year
- Survey of Small Language Models from Penn State, ...☆213Updated last week
- Evaluating LLMs with fewer examples☆167Updated last year