humza909 / LLM_Survey
☆86Updated last year
Alternatives and similar repositories for LLM_Survey:
Users that are interested in LLM_Survey are comparing it to the libraries listed below
- A Survey on Data Selection for Language Models☆218Updated 5 months ago
- Direct Preference Optimization from scratch in PyTorch☆87Updated last year
- Notes and commented code for RLHF (PPO)☆74Updated last year
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆453Updated 11 months ago
- ☆83Updated 2 weeks ago
- LLM hallucination paper list☆309Updated last year
- ☆253Updated last year
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆165Updated 3 months ago
- A curated list of Large Language Model with RAG☆79Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆177Updated 11 months ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆536Updated 3 months ago
- A curated paper list on LLM reasoning.☆84Updated last year
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆469Updated last month
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆100Updated 5 months ago
- Notes on Direct Preference Optimization☆18Updated 10 months ago
- Survey of Small Language Models from Penn State, ...☆164Updated last month
- ☆261Updated 7 months ago
- ☆81Updated 5 months ago
- ☆253Updated last week
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆77Updated last month
- The official evaluation suite and dynamic data release for MixEval.☆231Updated 4 months ago
- The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.☆135Updated 5 months ago
- Code and data for "Lost in the Middle: How Language Models Use Long Contexts"☆334Updated last year
- a curated list of the role of small models in the LLM era☆93Updated 5 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆159Updated this week
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆232Updated last year
- Fantastic Data Engineering for Large Language Models☆81Updated 2 months ago
- An Extensible Continual Learning Framework Focused on Language Models (LMs)☆269Updated last year
- Must-read Papers on Large Language Model (LLM) Continual Learning☆141Updated last year
- An assignment for building an NLP system from scratch.☆24Updated last year