Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。
☆297Apr 23, 2024Updated last year
Alternatives and similar repositories for llama3-chinese
Users that are interested in llama3-chinese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。☆4,158Feb 21, 2026Updated last month
- This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.☆318May 6, 2024Updated last year
- ☆346Jul 27, 2024Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Apr 12, 2024Updated last year
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆13Sep 1, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ssc-FinLLM-金融大模型☆27Apr 22, 2024Updated last year
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆510May 10, 2024Updated last year
- Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用☆14,740Apr 6, 2025Updated 11 months ago
- 中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3☆1,974Sep 23, 2024Updated last year
- 本项目采用Firefly模型训练框架,使用LLAMA-2模型对多项选择阅读理解任务(Multiple Choice MRC)进行微调,取得了显著的进步。☆11Sep 16, 2023Updated 2 years ago
- Phi3 中文后训练模型仓库☆325Nov 27, 2024Updated last year
- Sidekick is an AI powered tool that uses the OpenAI API and GPT-4 model for thinking, exploring ideas, problem-solving, knowledge-buildin…☆40Aug 29, 2024Updated last year
- 对话推荐系统展示☆12Sep 14, 2021Updated 4 years ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆69,106Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,652Oct 24, 2024Updated last year
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,161Jul 15, 2025Updated 8 months ago
- Repository of DISC-MedLLM, it is a comprehensive solution that leverages Large Language Models (LLMs) to provide accurate and truthful me…☆562Oct 28, 2023Updated 2 years ago
- 一个类似于o1的思维过程☆13Oct 8, 2024Updated last year
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆653Aug 17, 2024Updated last year
- This repo is my settings for using the local LLM with graphrag & an UI to chat with the index result☆16Jul 24, 2024Updated last year
- 一款轻量 Memo 记录程序,基于 Bun + Hono + MongoDB 构建☆11Jul 23, 2025Updated 8 months ago
- 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)☆18,963Jul 15, 2025Updated 8 months ago
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆285Mar 15, 2025Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆14Jan 4, 2023Updated 3 years ago
- bisheng-unstructured library☆58May 20, 2025Updated 10 months ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆31Mar 12, 2024Updated 2 years ago
- Question and Answer based on Anything.☆13,906Mar 24, 2025Updated last year
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, …☆13,391Updated this week
- A Next-Generation Training Engine Built for Ultra-Large MoE Models☆5,107Updated this week
- Reformatted Alignment☆111Sep 23, 2024Updated last year
- mnn asr demo.☆26Mar 24, 2025Updated last year
- An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API☆18Aug 21, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Workflow Defined Engine☆25Nov 4, 2025Updated 4 months ago
- ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型☆13,742Jan 13, 2025Updated last year
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- The official Meta Llama 3 GitHub site☆29,298Jan 26, 2025Updated last year
- Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain…☆37,662Nov 10, 2025Updated 4 months ago
- Using GPT to parse PDF☆3,561Apr 17, 2025Updated 11 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Apr 19, 2024Updated last year