Shenzhi-Wang / Llama3-Chinese-ChatView external linksLinks
This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.
☆321May 6, 2024Updated last year
Alternatives and similar repositories for Llama3-Chinese-Chat
Users that are interested in Llama3-Chinese-Chat are comparing it to the libraries listed below
Sorting:
- Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。☆296Apr 23, 2024Updated last year
- Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL☆24Oct 30, 2023Updated 2 years ago
- Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。☆4,151Jan 6, 2026Updated last month
- ☆19Mar 5, 2025Updated 11 months ago
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆40Oct 30, 2023Updated 2 years ago
- ☆14Dec 19, 2024Updated last year
- 中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3☆1,963Sep 23, 2024Updated last year
- ☆35Dec 2, 2025Updated 2 months ago
- ☆17Aug 9, 2023Updated 2 years ago
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆14Aug 12, 2024Updated last year
- Official implementation of Dynamic Perceiver☆43Nov 16, 2023Updated 2 years ago
- Phi3 中文后训练模型仓库☆322Nov 27, 2024Updated last year
- Jittor implementation of Vision Transformer with Deformable Attention☆32Mar 1, 2022Updated 3 years ago
- ☆53Jan 2, 2025Updated last year
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆47Jun 13, 2024Updated last year
- ☆16Apr 12, 2024Updated last year
- ☆37Jan 18, 2023Updated 3 years ago
- [ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning☆32Sep 30, 2024Updated last year
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆512May 10, 2024Updated last year
- [ECCV 2022] Learning to Weight Samples for Dynamic Early-exiting Networks☆37Sep 28, 2023Updated 2 years ago
- ☆11Aug 29, 2025Updated 5 months ago
- SCOPE ICLR 2025☆22Oct 3, 2025Updated 4 months ago
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆23Nov 16, 2022Updated 3 years ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆29May 10, 2024Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Apr 12, 2024Updated last year
- [NeurIPS 2022] Latency-aware Spatial-wise Dynamic Networks☆25Aug 21, 2023Updated 2 years ago
- Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用☆14,748Apr 6, 2025Updated 10 months ago
- ☆10Dec 29, 2023Updated 2 years ago
- Official repository of Vision Test-Time Training☆49Dec 7, 2025Updated 2 months ago
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆40Nov 5, 2023Updated 2 years ago
- [CVPR 2022] Official repository of AdaFocusV2.☆91Dec 15, 2024Updated last year
- 一個使用Streamlit框架和GPT3.5 turbo模型官方API,快速建置Web app於平台Render。☆12Apr 3, 2023Updated 2 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- ☆11Nov 23, 2025Updated 2 months ago
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated last year
- [Nature Machine Intelligence 2025] Emulating Human-like Adaptive Vision for Efficient and Flexible Machine Visual Perception☆134Nov 25, 2025Updated 2 months ago
- ☆31Feb 23, 2025Updated 11 months ago
- A mini assistant to help you read paper quickly☆55May 6, 2025Updated 9 months ago
- ☆46May 9, 2025Updated 9 months ago