sugarandgugu / Simple-Trl-Training
基于DPO算法微调语言大模型,简单好上手。
☆28Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for Simple-Trl-Training
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆67Updated last week
- ☆120Updated 7 months ago
- ☆88Updated 4 months ago
- 使用单个24G显卡,从0开始训练LLM☆49Updated 3 weeks ago
- ☆71Updated 10 months ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆155Updated 8 months ago
- ☆91Updated 11 months ago
- deepspeed+trainer简单高效实现多卡微调大模型☆116Updated last year
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆53Updated 5 months ago
- 阿里天池: 2023全球智能汽车AI挑战赛——赛道一:AI大模型检索问答 baseline 80+☆76Updated 10 months ago
- Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)☆69Updated 9 months ago
- llama,chatglm 等模型的微调☆82Updated 4 months ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆99Updated last year
- 怎么训练一个LLM分词器☆129Updated last year
- ☆63Updated last year
- A collection for math word problem (MWP) works, including datasets, algorithms and so on.☆36Updated 5 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆132Updated 4 months ago
- ☆53Updated 4 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆130Updated 3 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆124Updated 3 months ago
- ☆93Updated 8 months ago
- 大语言模型指令调优工具(支持 FlashAttention)☆166Updated 10 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆109Updated 5 months ago
- LAiW: A Chinese Legal Large Language Models Benchmark☆72Updated 4 months ago
- Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation☆37Updated 5 months ago
- ☆52Updated 2 months ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆27Updated 3 months ago
- ☆129Updated 4 months ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆173Updated 2 weeks ago
- Official github repo for ACLUE, an evaluation benchmark focused on ancient Chinese language comprehension☆23Updated 8 months ago