cooper12121 / llama3-ChineseLinks
pre-training llama3 using chinese
☆13Updated last year
Alternatives and similar repositories for llama3-Chinese
Users that are interested in llama3-Chinese are comparing it to the libraries listed below
Sorting:
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Updated last year
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆35Updated last year
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆59Updated 4 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆29Updated last year
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆91Updated 2 years ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆69Updated 8 months ago
- llms related stuff , including code, docs☆13Updated 7 months ago
- Its an open source LLM based on MOE Structure.☆58Updated last year
- ☆13Updated last year
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆39Updated 9 months ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆48Updated last week
- ☆27Updated 11 months ago
- ☆15Updated last year
- 大语言模型训练和服务调研☆36Updated 2 years ago
- 千问14B和7B的逐行解释☆62Updated 2 years ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 8 months ago
- 我们是第一个完全可商用的角色大模型。☆40Updated last year
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆40Updated 2 years ago
- ModelScope+Transformers+SwanLab实现Qwen-1.5-7b的指令微调任务☆23Updated last year
- accelerate generating vector by using onnx model☆17Updated last year
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆46Updated 9 months ago
- ✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体☆72Updated 4 months ago
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated last year
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆26Updated 6 months ago
- llama inference for tencentpretrain☆99Updated 2 years ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆25Updated last year
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆62Updated last year
- ☆164Updated last year