chunhuizhang / llms_tuningLinks

stay tuned.

☆16

Alternatives and similar repositories for llms_tuning

Users that are interested in llms_tuning are comparing it to the libraries listed below

Sorting:

yuanzhoulvpi2017 / SentenceEmbedding
☆111Updated last year
chunhuizhang / bert_t5_gpt
☆73Updated last month
akaihaoshuai / baby-llama2-chinese_cybertron
使用单个24G显卡，从0开始训练LLM
☆56Updated last week
Glanvery / LLM-Travel
欢迎来到 "LLM-travel" 仓库！探索大语言模型（LLM）的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。
☆328Updated 11 months ago
yuanzhoulvpi2017 / nano_rl
在verl上做reward的定制开发
☆70Updated last month
yanqiangmiffy / how-to-train-tokenizer
怎么训练一个LLM分词器
☆151Updated 2 years ago
percent4 / llm_math_solver
本项目用于大模型数学解题能力方面的数据集合成，模型训练及评测，相关文章记录。
☆91Updated 10 months ago
suu990901 / LLaMA-MiLe-Loss
Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models
☆64Updated 4 months ago
sugarandgugu / Simple-Trl-Training
基于DPO算法微调语言大模型，简单好上手。
☆40Updated last year
chunhuizhang / personal_chatgpt
personal chatgpt
☆377Updated 7 months ago
hengjiUSTC / learn-llm
☆111Updated 8 months ago
chunhuizhang / llm_rl
llm & rl
☆156Updated this week
lansinuote / Simple_LLM_DPO
☆71Updated last year
km1994 / llms_paper
该仓库主要记录 LLMs 算法工程师相关的顶会论文研读笔记（多模态、PEFT、小样本QA问答、RAG、LMMs可解释性、Agents、CoT）
☆343Updated last year
jiahe7ay / MINI_LLM
This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
☆452Updated 2 months ago
Mxoder / LLM-from-scratch
一些 LLM 方面的从零复现笔记
☆205Updated 2 months ago
taishan1994 / Llama3.1-Finetuning
对llama3进行全参微调、lora微调以及qlora微调。
☆202Updated 9 months ago
firechecking / CleanTransformer
an implementation of transformer, bert, gpt, and diffusion models for learning purposes
☆155Updated 9 months ago
pengr / LLM-Synthetic-Data
A live reading list for LLM-synthetic-data.
☆307Updated last week
Pillars-Creation / ChatGLM-RLHF-LoRA-RM-PPO
ChatGLM-6B添加了RLHF的实现，以及部分核心代码的逐行讲解 ,实例部分是做了个新闻短标题的生成，以及指定context推荐的RLHF的实现
☆86Updated last year
lansinuote / Simple_TRL
☆19Updated 11 months ago
CASIA-LM / MoDS
☆142Updated last year
yang19527 / AwesomeInterview
包含程序员面试大厂面试题和面试经验
☆141Updated last month
tianyi-lab / Cherry_LLM
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…
☆378Updated 3 weeks ago
nuochenpku / Awesome-Role-Play-Papers
Awesome papers for role-playing with language models
☆194Updated 8 months ago
cwxndl / LLM
大语言模型应用：RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛
☆64Updated 5 months ago
zejunwang1 / LLMTuner
大语言模型指令调优工具（支持 FlashAttention）
☆174Updated last year
dawoshi / Tianchi-LLM-QA
阿里天池: 2023全球智能汽车AI挑战赛——赛道一：AI大模型检索问答 baseline 80+
☆107Updated last year
git-cloner / llama2-lora-fine-tuning
llama2 finetuning with deepspeed and lora
☆175Updated last year
zhangzhao219 / WSDM-Cup-2024
1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc
☆160Updated last year