cooper12121 / llama3-ChineseLinks
pre-training llama3 using chinese
☆13Updated last year
Alternatives and similar repositories for llama3-Chinese
Users that are interested in llama3-Chinese are comparing it to the libraries listed below
Sorting:
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Updated last year
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆36Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- accelerate generating vector by using onnx model☆18Updated last year
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week
- Its an open source LLM based on MOE Structure.☆58Updated last year
- ☆94Updated last year
- ☆165Updated last year
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆59Updated 5 months ago
- 介绍docker、docker compose的使用。☆21Updated last year
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆91Updated 2 years ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆68Updated last year
- 一起来养一只拥有专属记忆的AI猫猫吧!☆10Updated last year
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆38Updated 10 months ago
- ☆27Updated last year
- ☆13Updated 7 months ago
- ☆16Updated 3 months ago
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆31Updated 4 months ago
- deepseek思维树模式实现☆21Updated 3 months ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated last year
- ☆15Updated last year
- 我们是第一个完全可商用的角色大模型。☆40Updated last year
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Updated 3 weeks ago
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆40Updated 2 years ago
- 大语言模型训练和服务调研☆36Updated 2 years ago
- llms related stuff , including code, docs☆13Updated 8 months ago
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆46Updated 9 months ago
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的 对话,模型大小根据手头的机器决定☆63Updated last year
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆25Updated last year
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Updated last year