bbruceyuan / bit-brainLinks
最少使用 3090 即可训练自己的比特大脑(miniLLM)🧠(进行中). Train your own BitBrain(A mini LLM) with just an RTX 3090 minimum.
☆19Updated last week
Alternatives and similar repositories for bit-brain
Users that are interested in bit-brain are comparing it to the libraries listed below
Sorting:
- ☆41Updated 2 months ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- accelerate generating vector by using onnx model☆17Updated last year
- 解锁HuggingFace生态的百般用法☆91Updated 5 months ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆75Updated 3 weeks ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆27Updated last year
- 大型语言模型实战指南:应用实践与场景落地☆71Updated 8 months ago
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆59Updated 9 months ago
- 模型压缩的小白入门教程☆22Updated 11 months ago
- ☆22Updated 3 months ago
- Llama3开源模型中文版-全方位测评,基于SuperCLUE基准 | Llama3 Chinese Evaluation with SuperCLUE☆16Updated last year
- 欢迎来到“筱可AI研习社”的实战项目仓库!这个仓库主要用于存储和展示为公众号撰写的各类实战项目。我们会不断优化和迭代这些项目,以探索AI的无限可能。☆43Updated last week
- 通义千问的DPO训练☆48Updated 8 months ago
- from MHA, MQA, GQA to MLA by 苏剑林, with code☆19Updated 3 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆60Updated 9 months ago
- a toolkit on knowledge distillation for large language models☆57Updated this week
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆59Updated last week
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆11Updated 3 months ago
- 百度QA100万数据集☆47Updated last year
- 大语言模型训练和服务调研☆37Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆90Updated last year
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆46Updated last week
- Collection of model-centric MCP servers☆17Updated 2 weeks ago
- Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SO…☆82Updated 4 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆38Updated 5 months ago
- 学习vLLM,使用vLLM部署Qwen2-0.5B的模型,并使用docker部署。☆18Updated 11 months ago
- 最简易的R1结果在小模型上的复现,阐 述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 3 months ago
- 本项目致力于为大模型领域的初学者提供全面的知识体系,包括基础和高阶内容,以便开发者能迅速掌握大模型技术栈并全面了解相关知识。☆59Updated 5 months ago
- 千问14B和7B的逐行解释☆60Updated last year