This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.
☆318May 6, 2024Updated last year
Alternatives and similar repositories for Llama3-Chinese-Chat
Users that are interested in Llama3-Chinese-Chat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆41Oct 30, 2023Updated 2 years ago
- Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL☆24Oct 30, 2023Updated 2 years ago
- Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。☆299Apr 23, 2024Updated last year
- Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。☆4,163Feb 21, 2026Updated last month
- ☆14Dec 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation☆35Sep 12, 2024Updated last year
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆15Aug 12, 2024Updated last year
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆47Jun 13, 2024Updated last year
- ☆18Aug 9, 2023Updated 2 years ago
- Official implementation of Dynamic Perceiver☆43Nov 16, 2023Updated 2 years ago
- 中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3☆1,970Sep 23, 2024Updated last year
- ☆37Jan 18, 2023Updated 3 years ago
- [ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning☆31Sep 30, 2024Updated last year
- ☆32Feb 23, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆23Nov 16, 2022Updated 3 years ago
- [ECCV 2022] Learning to Weight Samples for Dynamic Early-exiting Networks☆38Sep 28, 2023Updated 2 years ago
- [NeurIPS 2022] Latency-aware Spatial-wise Dynamic Networks☆25Aug 21, 2023Updated 2 years ago
- ☆16Apr 12, 2024Updated 2 years ago
- [CVPR 2022] Official repository of AdaFocusV2.☆91Dec 15, 2024Updated last year
- [NeurIPS 2023] Rank-DETR for High Quality Object Detection☆105Oct 19, 2023Updated 2 years ago
- [IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition☆53Mar 20, 2025Updated last year
- ☆15Aug 4, 2025Updated 8 months ago
- Official repository of Uni-AdaFocus (TPAMI 2024).☆61Dec 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用☆14,734Apr 6, 2025Updated last year
- Phi3 中文后训练模型仓库☆324Nov 27, 2024Updated last year
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆510May 10, 2024Updated last year
- [IEEE TIP] Fine-grained Recognition with Learnable Semantic Data Augmentation☆31Dec 23, 2023Updated 2 years ago
- Repository of GridMix (ICLR 2025)☆36Mar 18, 2025Updated last year
- [ICCV 2023] Adaptive Rotated Convolution for Rotated Object Detection☆142Mar 15, 2025Updated last year
- LLM-MapBook: AI-Powered Maps for Storytelling. Extracts geo-coordinates from books, visualizes on interactive maps, offering immersive st…☆12Aug 27, 2024Updated last year
- [EMNLP 2024] A Video Chat Agent with Temporal Prior☆32Mar 2, 2025Updated last year
- ☆14Dec 16, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆70,203Apr 12, 2026Updated last week
- [ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators☆47Sep 11, 2024Updated last year
- Retrieval and Retrieval-augmented LLMs☆11,537Apr 1, 2026Updated 2 weeks ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆27Jan 4, 2026Updated 3 months ago
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华 中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆40Nov 5, 2023Updated 2 years ago
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆7,078Jul 4, 2025Updated 9 months ago
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,153Updated this week