This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.
☆319May 6, 2024Updated last year
Alternatives and similar repositories for Llama3-Chinese-Chat
Users that are interested in Llama3-Chinese-Chat are comparing it to the libraries listed below
Sorting:
- Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。☆296Apr 23, 2024Updated last year
- Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL☆24Oct 30, 2023Updated 2 years ago
- Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。☆4,154Feb 21, 2026Updated 2 weeks ago
- ☆19Mar 5, 2025Updated last year
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆40Oct 30, 2023Updated 2 years ago
- ☆14Dec 19, 2024Updated last year
- [ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation☆35Sep 12, 2024Updated last year
- 中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3☆1,969Sep 23, 2024Updated last year
- LLM-MapBook: AI-Powered Maps for Storytelling. Extracts geo-coordinates from books, visualizes on interactive maps, offering immersive st…☆12Aug 27, 2024Updated last year
- ☆35Dec 2, 2025Updated 3 months ago
- ☆17Aug 9, 2023Updated 2 years ago
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆14Aug 12, 2024Updated last year
- Official implementation of Dynamic Perceiver☆43Nov 16, 2023Updated 2 years ago
- Phi3 中文后训练模型仓库☆324Nov 27, 2024Updated last year
- Jittor implementation of Vision Transformer with Deformable Attention☆32Mar 1, 2022Updated 4 years ago
- ☆53Jan 2, 2025Updated last year
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆47Jun 13, 2024Updated last year
- accelerate generating vector by using onnx model☆18Jan 23, 2024Updated 2 years ago
- [ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning☆32Sep 30, 2024Updated last year
- ☆37Jan 18, 2023Updated 3 years ago
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆512May 10, 2024Updated last year
- [ECCV 2022] Learning to Weight Samples for Dynamic Early-exiting Networks☆38Sep 28, 2023Updated 2 years ago
- ☆11Aug 29, 2025Updated 6 months ago
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆23Nov 16, 2022Updated 3 years ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆29May 10, 2024Updated last year
- SCOPE ICLR 2025☆23Oct 3, 2025Updated 5 months ago
- [NeurIPS 2022] Latency-aware Spatial-wise Dynamic Networks☆25Aug 21, 2023Updated 2 years ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Apr 12, 2024Updated last year
- Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用☆14,746Apr 6, 2025Updated 11 months ago
- ☆10Dec 29, 2023Updated 2 years ago
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆40Nov 5, 2023Updated 2 years ago
- [CVPR 2022] Official repository of AdaFocusV2.☆91Dec 15, 2024Updated last year
- [CVPR 2026] Official repository of Vision Test-Time Training☆55Feb 21, 2026Updated 2 weeks ago
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated last year
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- 一個使用Streamlit框架和GPT3.5 turbo模型官方API,快速建置Web app於平台Render。☆12Apr 3, 2023Updated 2 years ago
- ☆12Nov 23, 2025Updated 3 months ago
- ☆31Feb 23, 2025Updated last year
- [NeurIPS 2023] Rank-DETR for High Quality Object Detection☆105Oct 19, 2023Updated 2 years ago