xyjigsaw / LLM-Pretrain-SFT
Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)
โ79Updated last year
Alternatives and similar repositories for LLM-Pretrain-SFT:
Users that are interested in LLM-Pretrain-SFT are comparing it to the libraries listed below
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimationโ78Updated 5 months ago
- ๐ An unofficial implementation of Self-Alignment with Instruction Backtranslation.โ139Updated 10 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuningโ251Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningโ147Updated 7 months ago
- โ139Updated last year
- โ81Updated last year
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddโฆโ57Updated 4 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)โ80Updated 2 months ago
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Generโฆโ60Updated 9 months ago
- โ143Updated 9 months ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodingsโ153Updated 10 months ago
- โ97Updated last year
- ๆไน่ฎญ็ปไธไธชLLMๅ่ฏๅจโ144Updated last year
- [ICLR 2025] ๐งฌ RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)โ127Updated 2 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.โ42Updated 9 months ago
- Collection of training data management explorations for large language modelsโ322Updated 8 months ago
- โ174Updated last year
- โ107Updated 5 months ago
- โ94Updated last year
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other moโฆโ358Updated 7 months ago
- Fantastic Data Engineering for Large Language Modelsโ87Updated 3 months ago
- โ123Updated last year
- YuLan-IR: Information Retrieval Boosted LMsโ218Updated last year
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Incโ161Updated last year
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsโ166Updated 10 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"โ124Updated 10 months ago
- LongQLoRA: Extent Context Length of LLMs Efficientlyโ164Updated last year
- Counting-Stars (โ )โ82Updated 7 months ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatโ115Updated last year
- โ157Updated 2 weeks ago