linjh1118 / Llama3-Chinese-ORPOLinks

基于Llama3，通过进一步CPT，SFT，ORPO得到的中文版Llama3

☆17

Alternatives and similar repositories for Llama3-Chinese-ORPO

Users that are interested in Llama3-Chinese-ORPO are comparing it to the libraries listed below

Sorting:

cooper12121 / llama3-Chinese
pre-training llama3 using chinese
☆13Updated last year
ArtificialZeng / llama3_explained
the newest version of llama3，source code explained line by line using Chinese
☆22Updated last year
CrazyBoyM / LLM-Chinese
（撰写ing..)本仓库偏教程性质，以「模型中文化」为一个典型的模型训练问题切入场景，指导读者上手学习LLM二次微调训练。
☆34Updated last year
Azure99 / BlossomData
A fluent, scalable, and easy-to-use LLM data processing framework.
☆24Updated 2 weeks ago
Alannikos / FunGPT
In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…
☆58Updated 2 months ago
M1n9X / GraphRAG_Lite
☆16Updated last year
seanzhang-zhichen / Qwen-WisdomVast
Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …
☆18Updated last year
Dylan9897 / LLM-TextClassification
集成Qwen与DeepSeek等先进大语言模型，支持纯LLM+分类层模式及LLM+LoRA+分类层模式，使用transformers模块化设计和训练便于根据需要调整或替换组件。
☆13Updated 4 months ago
StarRing2022 / R1-Nature
最简易的R1结果在小模型上的复现，阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证，对于强推理能力，think思考过程性内容是AGI/ASI的核心。
☆44Updated 6 months ago
Minami-su / character_AI_open
Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.
☆134Updated 7 months ago
multimodal-art-projection / Megatron-LM-NEO
☆40Updated last year
cooper12121 / llama3-8x8b-MoE
Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…
☆27Updated last year
1100111GTH / XG-RAG
LLM RAG 应用，支持 API 调用，语音交互。
☆11Updated last year
ClosedCharacter / Peach
我们是第一个完全可商用的角色大模型。
☆40Updated 11 months ago
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆33Updated last year
woshixiaobai2019 / agent-gym
☆39Updated last month
Tongyi-Zhiwen / QwenLong-L1
☆288Updated 2 months ago
limafang / tiny-graphrag
☆41Updated 3 months ago
Bui1dMySea / MemLong
☆94Updated 8 months ago
MikeGu721 / AgentGroup
☆91Updated last year
heyblackC / BetterMixture-Top1-Solution
天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案
☆31Updated last year
SUSTech-IDEA / SUS-Chat
SUS-Chat: Instruction tuning done right
☆49Updated last year
percent4 / llm_math_solver
本项目用于大模型数学解题能力方面的数据集合成，模型训练及评测，相关文章记录。
☆91Updated 10 months ago
Lightblues / AgentRE
Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".
☆69Updated last year
RUC-GSAI / YuLan-Mini
A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.
☆207Updated 2 weeks ago
Neph0s / InCharacter
Official code for the paper: InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews (previo…
☆82Updated 2 months ago
yongzhuo / qwen2-sft
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
☆64Updated last year
jackfsuia / LLM-Data-Cleaner
用大模型批量处理数据，现支持各种大模型做OCR，支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, m…
☆15Updated 10 months ago
chuxin-llm / Chuxin-Embedding
☆27Updated 9 months ago
Liuziyu77 / Soda
Search, organize, discover anything!
☆48Updated last year