lansinuote / Simple_TRLLinks
☆18Updated 9 months ago
Alternatives and similar repositories for Simple_TRL
Users that are interested in Simple_TRL are comparing it to the libraries listed below
Sorting:
- ☆76Updated 8 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆39Updated 11 months ago
- ☆69Updated last year
- ☆110Updated 11 months ago
- ☆83Updated last month
- personal chatgpt☆372Updated 5 months ago
- 怎么训练一个LLM分词器☆149Updated last year
- Inference code for LLaMA models☆121Updated last year
- ChatGLM-6B添加了RLHF的实现,以及部分核心代码的逐行讲解 ,实例部分是做了个新闻短标题的生成,以及指定context推荐的RLHF的实现☆85Updated last year
- 通义千问的DPO训练☆48Updated 8 months ago
- 使用单个24G显卡,从0开始训练LLM☆54Updated last week
- ☆23Updated last year
- Train your grpo with zero dataset and low resources, 8bit/4bit/lora/qlora supported, multi-gpu supported ...☆71Updated last month
- llm & rl☆134Updated last week
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆31Updated 10 months ago
- 一些 LLM 方面的从零复现笔记☆200Updated last month
- ☆43Updated 9 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆61Updated last year
- llama2 finetuning with deepspeed and lora☆174Updated last year
- 阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理☆104Updated last year
- WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge☆123Updated 6 months ago
- ☆71Updated last week
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆62Updated 3 months ago
- ☆76Updated 9 months ago
- 大模型基础学习和面试八股文☆122Updated last year
- baichuan LLM surpervised finetune by lora☆63Updated last year
- ☆141Updated last year
- 本项目是自动化学报中AUTOPLAN的代码地址,使用大语言模型完成了复杂任务的任务规划以及任务执行☆100Updated 6 months ago
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆118Updated last year
- ☆79Updated 4 months ago