heyblackC / BetterMixture-Top1-SolutionLinks

天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案

☆31

Alternatives and similar repositories for BetterMixture-Top1-Solution

Users that are interested in BetterMixture-Top1-Solution are comparing it to the libraries listed below

Sorting:

muyaostudio / qwen2_seq_cls
使用 Qwen2ForSequenceClassification 简单实现文本分类任务。
☆73Updated last year
cwxndl / LLM
大语言模型应用：RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛
☆66Updated 5 months ago
akaihaoshuai / baby-llama2-chinese_cybertron
使用单个24G显卡，从0开始训练LLM
☆56Updated last month
sugarandgugu / Simple-Trl-Training
基于DPO算法微调语言大模型，简单好上手。
☆42Updated last year
yuanzhoulvpi2017 / SentenceEmbedding
☆113Updated last year
yanqiangmiffy / how-to-train-tokenizer
怎么训练一个LLM分词器
☆151Updated 2 years ago
yongzhuo / qwen2-sft
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
☆64Updated last year
ZBayes / poc_project
通用简单工具项目
☆20Updated 10 months ago
owenliang / qwen-dpo
通义千问的DPO训练
☆51Updated 10 months ago
poisonwine / Tianchi-LLM-retrieval
2023全球智能汽车AI挑战赛——赛道一：AI大模型检索问答， 75+ baseline
☆60Updated last year
percent4 / llm_math_solver
本项目用于大模型数学解题能力方面的数据集合成，模型训练及评测，相关文章记录。
☆91Updated 10 months ago
shyoulala / Kaggle_Eedi_2024_sayoulala
kaggle 2024 Eedi 第10名金牌方案
☆36Updated 7 months ago
826568389 / GRPO-R1
☆13Updated 4 months ago
zejunwang1 / LLMTuner
大语言模型指令调优工具（支持 FlashAttention）
☆175Updated last year
Pillars-Creation / ChatGLM-RLHF-LoRA-RM-PPO
ChatGLM-6B添加了RLHF的实现，以及部分核心代码的逐行讲解 ,实例部分是做了个新闻短标题的生成，以及指定context推荐的RLHF的实现
☆86Updated last year
dawoshi / Tianchi-LLM-QA
阿里天池: 2023全球智能汽车AI挑战赛——赛道一：AI大模型检索问答 baseline 80+
☆110Updated last year
zhangzhao219 / WSDM-Cup-2024
1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc
☆160Updated 2 weeks ago
RUC-GSAI / Llama-3-SynE
Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …
☆34Updated 2 months ago
issaccv / aiops24-RAG-demo
用于AIOPS24挑战赛的Demo
☆62Updated last year
IronBeliever / CaR
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
☆86Updated 8 months ago
xubuvd / LLMs
专注于中文领域大语言模型，落地到某个行业某个领域，成为一个行业大模型、公司级别或行业级别领域大模型。
☆119Updated 5 months ago
CASIA-LM / MoDS
☆144Updated last year
limafang / tiny-graphrag
☆41Updated 2 months ago
CSHaitao / ChatGLM_mutli_gpu_tuning
deepspeed+trainer简单高效实现多卡微调大模型
☆126Updated 2 years ago
hengjiUSTC / learn-llm
☆112Updated 8 months ago
zysNLP / quickllm
A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …
☆46Updated last month
Dylan9897 / LLM-TextClassification
集成Qwen与DeepSeek等先进大语言模型，支持纯LLM+分类层模式及LLM+LoRA+分类层模式，使用transformers模块化设计和训练便于根据需要调整或替换组件。
☆13Updated 4 months ago
Glanvery / LLM-Travel
欢迎来到 "LLM-travel" 仓库！探索大语言模型（LLM）的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。
☆331Updated last year
taishan1994 / Llama3.1-Finetuning
对llama3进行全参微调、lora微调以及qlora微调。
☆204Updated 10 months ago
shuyhere / all-about-llm
大语言模型训练和服务调研
☆37Updated 2 years ago