cooper12121 / llama3-ChineseLinks

pre-training llama3 using chinese

☆13

Alternatives and similar repositories for llama3-Chinese

Users that are interested in llama3-Chinese are comparing it to the libraries listed below

Sorting:

linjh1118 / Llama3-Chinese-ORPO
基于Llama3，通过进一步CPT，SFT，ORPO得到的中文版Llama3
☆17Updated last year
ArtificialZeng / llama3_explained
the newest version of llama3，source code explained line by line using Chinese
☆22Updated last year
zysNLP / quickllm
A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …
☆46Updated last week
shootime2021 / APUS-xDAN-4.0-moe
Its an open source LLM based on MOE Structure.
☆58Updated 11 months ago
yanqiangmiffy / Agent-Tutorials-ZH
大模型智能体Agent中文教程，博客代码仓库
☆19Updated this week
HITsz-TMG / YiZhao
YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual fin…
☆26Updated 6 months ago
ArtificialZeng / Qwen-Explained
千问14B和7B的逐行解释
☆60Updated last year
t6am3 / law_glm_baseline
☆15Updated last year
infinigence / InfiniWebSearch
A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.
☆38Updated 6 months ago
CrazyBoyM / LLM-Chinese
（撰写ing..)本仓库偏教程性质，以「模型中文化」为一个典型的模型训练问题切入场景，指导读者上手学习LLM二次微调训练。
☆34Updated 10 months ago
shibing624 / github-hot
Tracking the hot Github repos and update daily 每天自动追踪Github热门项目
☆49Updated this week
cwxndl / LLM
大语言模型应用：RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛
☆62Updated 4 months ago
reilxlx / llava-Qwen2-7B-Instruct-Chinese-CLIP
模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力，接近gpt4o、claude-3.5-sonnet的识别水平！
☆23Updated 11 months ago
QwenLM / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆135Updated 6 months ago
ziwang-com / chinese-StableVicuna
全球首个StableVicuna中文优化版。
☆64Updated last year
modelscope / mcp-central
Collection of model-centric MCP servers
☆20Updated last month
AI-Ceping / LLM-Ceping
全方位大模型评测知识库 | 提示词工程（Prompt Engineer）、各渠道大模型榜单（LeaderBoard）、标杆数据集、安全检测、对抗攻击、智能体、优质数据、文本分类、关系抽取、语音识别、语音合成、多模态、文本生成图片、文本生成视频、点云、智能对话、摘要总结、问答…
☆63Updated 7 months ago
ssbuild / aigc_data
share data， prompt data , pretraining data
☆36Updated last year
sakharamg / KITLM
☆13Updated last year
Airmomo / SPO
SPO | Self-Supervised Prompt Optimization
☆25Updated 3 months ago
taishan1994 / qlora-chinese-LLM
使用qlora对中文大语言模型进行微调，包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE
☆87Updated 2 years ago
Airmomo / tpo-llm-webui
TPO 是一个优化 LLM 输出文本的框架，通过迭代反馈和优化提示的方式来“微调模型”，而非直接调整模型的参数，使模型在推理过程中与人类偏好对齐以生成更好的结果。本项目提供了一个友好的 WebUI 来加载模型，实时优化基础模型并展示最佳结果。
☆10Updated 4 months ago
360AILABNLP / 360LayoutAnalysis
☆27Updated 8 months ago
cooper12121 / llama3-8x8b-MoE
Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…
☆27Updated 11 months ago
ziwang-com / mini-AGI
GPT+神器，简单实用的一站式AGI架构，内置本地化，LLM模型，agent，矢量数据库，智能链chain
☆48Updated last year
ssbuild / deep_training
deep learning
☆148Updated last month
thunlp / Delta-CoMe
Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024
☆57Updated 7 months ago
826568389 / GRPO-R1
☆12Updated 3 months ago
1100111GTH / XG-RAG
LLM RAG 应用，支持 API 调用，语音交互。
☆11Updated last year
peilongchencc / docker_tutorial
介绍docker、docker compose的使用。
☆20Updated 9 months ago