Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。
☆299Apr 23, 2024Updated 2 years ago
Alternatives and similar repositories for llama3-chinese
Users that are interested in llama3-chinese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Llama3-中文后训练版☆4,152Feb 21, 2026Updated 3 months ago
- This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.☆318May 6, 2024Updated 2 years ago
- ☆343Jul 27, 2024Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆17Apr 12, 2024Updated 2 years ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆14Sep 1, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ssc-FinLLM-金融大模型☆27Apr 22, 2024Updated 2 years ago
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆508May 10, 2024Updated 2 years ago
- ☆22Jul 1, 2024Updated last year
- Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用☆14,712Apr 6, 2025Updated last year
- 中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3☆1,977Apr 19, 2026Updated last month
- 本项目采用Firefly模型训练框架,使用LLAMA-2模型对多项选择阅读理解任务(Multiple Choice MRC)进行微调,取得了显著的进步。☆11Sep 16, 2023Updated 2 years ago
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated 2 years ago
- Phi3 中文后训练模型仓库☆324Nov 27, 2024Updated last year
- Sidekick is an AI powered tool that uses the OpenAI API and GPT-4 model for thinking, exploring ideas, problem-solving, knowledge-buildin…☆42Aug 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 对话推荐系统展示☆12Sep 14, 2021Updated 4 years ago
- parse partial json string☆18Nov 16, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,643Oct 24, 2024Updated last year
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆71,697Updated this week
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,139Apr 19, 2026Updated last month
- Repository of DISC-MedLLM, it is a comprehensive solution that leverages Large Language Models (LLMs) to provide accurate and truthful me…☆565Oct 28, 2023Updated 2 years ago
- 一个类似于o1的思维过程☆13Oct 8, 2024Updated last year
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆652Aug 17, 2024Updated last year
- This repo is my settings for using the local LLM with graphrag & an UI to chat with the index result☆16Jul 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 一款轻量 Memo 记录程序,基于 Bun + Hono + MongoDB 构建☆11Jul 23, 2025Updated 10 months ago
- 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)☆18,940Apr 19, 2026Updated last month
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆287Mar 15, 2025Updated last year
- 基于stable-diffusion的虚拟换装方法☆11Apr 27, 2024Updated 2 years ago
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆14Jan 4, 2023Updated 3 years ago
- An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API☆18Aug 21, 2025Updated 9 months ago
- bisheng-unstructured library☆58May 20, 2025Updated last year
- Question and Answer based on Anything.☆14,001Mar 24, 2025Updated last year
- A Next-Generation Training Engine Built for Ultra-Large MoE Models☆5,138Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Reformatted Alignment☆111Sep 23, 2024Updated last year
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…☆14,218May 22, 2026Updated last week
- mnn asr demo.☆27Mar 24, 2025Updated last year
- Workflow Defined Engine☆25Nov 4, 2025Updated 6 months ago
- ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型☆13,695Jan 13, 2025Updated last year
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- The official Meta Llama 3 GitHub site☆29,288Jan 26, 2025Updated last year