Baichuan2代码的逐行解析版本,适合小白
☆211Sep 20, 2023Updated 2 years ago
Alternatives and similar repositories for Baichuan2-Explained
Users that are interested in Baichuan2-Explained are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- baichuan LLM surpervised finetune by lora☆63Jun 28, 2023Updated 2 years ago
- A series of large language models developed by Baichuan Intelligent Technology☆4,102Nov 8, 2024Updated last year
- Baichuan-13B 指令微调☆88Jul 14, 2023Updated 2 years ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆42Aug 16, 2023Updated 2 years ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,931Sep 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15Nov 11, 2024Updated last year
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,658Jul 18, 2024Updated last year
- LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案☆1,315Dec 14, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,643Oct 24, 2024Updated last year
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated 2 years ago
- 语言模型中文认知能力分析☆236Sep 9, 2023Updated 2 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆22Jun 22, 2023Updated 2 years ago
- SIGIR 2022 CODE☆10Apr 1, 2022Updated 4 years ago
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆16Sep 25, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆164Apr 17, 2023Updated 3 years ago
- 🩹Editing large language models within 10 seconds⚡☆1,364Aug 13, 2023Updated 2 years ago
- ☆197Feb 6, 2025Updated last year
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆158Jul 25, 2025Updated 10 months ago
- ☆19Mar 22, 2024Updated 2 years ago
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Aug 15, 2023Updated 2 years ago
- 怎么训练一个LLM分词器☆152Jul 13, 2023Updated 2 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- 从socket开始实现pop3和smtp客户端,实现邮件编写、发送、接收、阅读、删除等基本功能。并实现简单界面(PyQt5)Start from socket to implement pop3 and smtp clients, to realize the basic …☆12Dec 24, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆642Apr 9, 2024Updated 2 years ago
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,796Dec 12, 2023Updated 2 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调☆3,724Oct 12, 2023Updated 2 years ago
- ☆25Jan 24, 2023Updated 3 years ago
- 演示Gemma中文指令微调的教程☆45Feb 26, 2024Updated 2 years ago
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,281Oct 16, 2024Updated last year
- Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用☆14,712Apr 6, 2025Updated last year
- chatglm 6b finetuning and alpaca finetuning☆1,535Mar 9, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆40Jul 15, 2025Updated 10 months ago
- 高性能文本 Tokenizer 库☆31Feb 2, 2024Updated 2 years ago
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,191May 3, 2025Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆62Oct 23, 2024Updated last year
- deep training task☆30Apr 28, 2023Updated 3 years ago
- MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。☆5,428Apr 28, 2026Updated last month
- 用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆255Aug 1, 2023Updated 2 years ago