Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。
☆299Apr 23, 2024Updated 2 years ago
Alternatives and similar repositories for llama3-chinese
Users that are interested in llama3-chinese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Llama3-中文后训练版☆4,152Feb 21, 2026Updated 3 months ago
- This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.☆317May 6, 2024Updated 2 years ago
- ☆343Jul 27, 2024Updated last year
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆14Sep 1, 2025Updated 9 months ago
- ssc-FinLLM-金融大模型☆27Apr 22, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆508May 10, 2024Updated 2 years ago
- ☆22Jul 1, 2024Updated last year
- Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用☆14,713Apr 6, 2025Updated last year
- 中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3☆1,975Apr 19, 2026Updated 2 months ago
- 本项目采用Firefly模型训练框架,使用LLAMA-2模型对多项选择阅读理解任务(Multiple Choice MRC)进行微调,取得了显著的进步。☆11Sep 16, 2023Updated 2 years ago
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated 2 years ago
- Phi3 中文后训练模型仓库☆325Nov 27, 2024Updated last year
- Sidekick is an AI powered tool that uses the OpenAI API and GPT-4 model for thinking, exploring ideas, problem-solving, knowledge-buildin…☆42Aug 29, 2024Updated last year
- 对话推荐系统展示☆12Sep 14, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,641Oct 24, 2024Updated last year
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆72,107Jun 10, 2026Updated last week
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,137Apr 19, 2026Updated 2 months ago
- Repository of DISC-MedLLM, it is a comprehensive solution that leverages Large Language Models (LLMs) to provide accurate and truthful me…☆565Oct 28, 2023Updated 2 years ago
- 一个类似于o1的思维过程☆13Oct 8, 2024Updated last year
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆651Aug 17, 2024Updated last year
- This repo is my settings for using the local LLM with graphrag & an UI to chat with the index result☆16Jul 24, 2024Updated last year
- 一款轻量 Memo 记录程序,基于 Bun + Hono + MongoDB 构建☆11Jul 23, 2025Updated 10 months ago
- 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)☆18,946Apr 19, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆288Mar 15, 2025Updated last year
- 基于stable-diffusion的虚拟换装方法☆11Apr 27, 2024Updated 2 years ago
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆14Jan 4, 2023Updated 3 years ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆31Mar 12, 2024Updated 2 years ago
- Question and Answer based on Anything.☆14,012Mar 24, 2025Updated last year
- A Next-Generation Training Engine Built for Ultra-Large MoE Models☆5,150Updated this week
- Reformatted Alignment☆111Sep 23, 2024Updated last year
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…☆14,496Updated this week
- mnn asr demo.☆27Mar 24, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Workflow Defined Engine☆25Nov 4, 2025Updated 7 months ago
- 以InternLM2-chat-7为基座模型,以常用中药等为数据集,微调的大模型。中医聊天小助手。☆18Feb 29, 2024Updated 2 years ago
- ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型☆13,677Jan 13, 2025Updated last year
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- The official Meta Llama 3 GitHub site☆29,284Jan 26, 2025Updated last year
- Automated content cross posting from Notion Database to Dev.to, Hashnode, Medium, Twitter, and LinkedIn using GitHub Actions.☆13Oct 21, 2024Updated last year
- Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain…☆38,167Nov 10, 2025Updated 7 months ago