Baichuan2代码的逐行解析版本,适合小白
☆212Sep 20, 2023Updated 2 years ago
Alternatives and similar repositories for Baichuan2-Explained
Users that are interested in Baichuan2-Explained are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- baichuan LLM surpervised finetune by lora☆64Jun 28, 2023Updated 2 years ago
- A series of large language models developed by Baichuan Intelligent Technology☆4,113Nov 8, 2024Updated last year
- Baichuan-13B 指令微调☆90Jul 14, 2023Updated 2 years ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆42Aug 16, 2023Updated 2 years ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,933Sep 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15Nov 11, 2024Updated last year
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,671Jul 18, 2024Updated last year
- LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案☆1,311Dec 14, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,649Oct 24, 2024Updated last year
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated last year
- 语言模型中文认知能力分析☆235Sep 9, 2023Updated 2 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆23Jun 22, 2023Updated 2 years ago
- SIGIR 2022 CODE☆10Apr 1, 2022Updated 4 years ago
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆16Sep 25, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆164Apr 17, 2023Updated 3 years ago
- 🩹Editing large language models within 10 seconds⚡☆1,361Aug 13, 2023Updated 2 years ago
- ☆196Feb 6, 2025Updated last year
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆159Jul 25, 2025Updated 8 months ago
- ☆19Mar 22, 2024Updated 2 years ago
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Aug 15, 2023Updated 2 years ago
- 怎么训练一个LLM分词器☆152Jul 13, 2023Updated 2 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- 从socket开始实现pop3和smtp客户端,实现邮件编写、发送、接收、阅读、删除等基本功能。并实现简单界面(PyQt5)Start from socket to implement pop3 and smtp clients, to realize the basic …☆12Dec 24, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆642Apr 9, 2024Updated 2 years ago
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,798Dec 12, 2023Updated 2 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调☆3,723Oct 12, 2023Updated 2 years ago
- ☆25Jan 24, 2023Updated 3 years ago
- 演示Gemma中文指令微调的教程☆45Feb 26, 2024Updated 2 years ago
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,287Oct 16, 2024Updated last year
- Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用☆14,734Apr 6, 2025Updated last year
- chatglm 6b finetuning and alpaca finetuning☆1,536Mar 9, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆40Jul 15, 2025Updated 9 months ago
- 高性能文本 Tokenizer 库☆32Feb 2, 2024Updated 2 years ago
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,193May 3, 2025Updated 11 months ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆62Oct 23, 2024Updated last year
- MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。☆5,212Apr 9, 2026Updated last week
- deep training task☆30Apr 28, 2023Updated 2 years ago
- 用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆256Aug 1, 2023Updated 2 years ago