chunhuizhang / deeplearning-envs
深度学习软硬件配置(小白向)
☆30Updated 3 months ago
Alternatives and similar repositories for deeplearning-envs:
Users that are interested in deeplearning-envs are comparing it to the libraries listed below
- Recursive Abstractive Processing for Tree-Organized Retrieval☆11Updated 10 months ago
- 大型语言模型实战指南:应用实践与场景落地☆68Updated 6 months ago
- pretrain a wiki llm using transformers☆33Updated 7 months ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- accelerate generating vector by using onnx model☆16Updated last year
- ChatGLM2-6B-Explained☆35Updated last year
- pytorch分布式训练☆65Updated last year
- 模型压缩的小白入门教程☆22Updated 9 months ago
- 大语言模型训练和服务调研☆37Updated last year
- ☆30Updated 3 weeks ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 11 months ago
- Large-scale exact string matching tool☆16Updated last month
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆55Updated 10 months ago
- ☆14Updated last year
- Manages vllm-nccl dependency☆17Updated 10 months ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆43Updated 3 weeks ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆56Updated 4 months ago
- 演示Gemma中文指令微调的教程☆46Updated last year
- 2023全球智能汽车AI挑战赛——赛道一:AI大模型检索问答, 75+ baseline☆56Updated last year
- qwen models finetuning☆95Updated 3 weeks ago
- ☆41Updated 3 months ago
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆47Updated 4 months ago
- 用于AIOPS24挑战赛的Demo☆61Updated 9 months ago
- share data, prompt data , pretraining data☆36Updated last year
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆152Updated 5 months ago
- Imitate OpenAI with Local Models☆88Updated 7 months ago
- LoRA☆19Updated last year
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆61Updated last year
- ☆105Updated last year
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated last year