chunhuizhang / deeplearning-envsLinks

深度学习软硬件配置（小白向）

☆34

Alternatives and similar repositories for deeplearning-envs

Users that are interested in deeplearning-envs are comparing it to the libraries listed below

Sorting:

ssbuild / qwen_finetuning
qwen models finetuning
☆104Updated 7 months ago
peilongchencc / docker_tutorial
介绍docker、docker compose的使用。
☆21Updated last year
liucongg / LLMsBook
大型语言模型实战指南：应用实践与场景落地
☆79Updated last year
amulil / vector_by_onnxmodel
accelerate generating vector by using onnx model
☆18Updated last year
ssbuild / aigc_data
share data， prompt data , pretraining data
☆36Updated last year
ironartisan / awesome-compression1
模型压缩的小白入门教程
☆22Updated last year
lilongxian / BaiYang-chatGLM2-6B
（1）弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练，提高万级tokens性能支持。（2）证据理论解释学习，提升模型的复杂逻辑推理能力（3）兼容alpaca数据格式。
☆45Updated 2 years ago
ArtificialZeng / ChatGLM2-6B-Explained
ChatGLM2-6B-Explained
☆36Updated 2 years ago
the-seeds / imitater
Imitate OpenAI with Local Models
☆88Updated last year
windmaple / Gemma-Chinese-instruction-tuning
演示Gemma中文指令微调的教程
☆46Updated last year
ArtificialZeng / Qwen-Explained
千问14B和7B的逐行解释
☆62Updated 2 years ago
zzlgreat / smart_agent
☆106Updated 2 years ago
yanqiangmiffy / Agent-Tutorials-ZH
大模型智能体Agent中文教程，博客代码仓库
☆39Updated this week
zysNLP / quickllm
A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …
☆48Updated this week
yanqiangmiffy / tree2retriever
Recursive Abstractive Processing for Tree-Organized Retrieval
☆10Updated last year
airaria / GRAIN
GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models
☆19Updated 2 years ago
ArtificialZeng / llama3_explained
the newest version of llama3，source code explained line by line using Chinese
☆22Updated last year
taishan1994 / pytorch-distributed-NLP
pytorch分布式训练
☆69Updated 2 years ago
Oneflow-Inc / one-glm
A more efficient GLM implementation!
☆54Updated 2 years ago
ChaosWang666 / Ziya-LLaMA-13B-deployment
Ziya-LLaMA-13B是IDEA基于LLaMa的130亿参数的大规模预训练模型，具备翻译，编程，文本分类，信息抽取，摘要，文案生成，常识问答和数学计算等能力。目前姜子牙通用大模型已完成大规模预训练、多任务有监督微调和人类反馈学习三阶段的训练过程。本文主要用于Ziya-…
☆45Updated 2 years ago
hyperai / vllm-cn
vLLM Documentation in Chinese Simplified / vLLM 中文文档
☆110Updated last month
AFAC2024 / AFAC2024-Advanced-Fintech-AI-Competition
AFAC2024金融智能创新大赛
☆56Updated 10 months ago
ArtificialZeng / baichuan-speedup
纯c++的全平台llm加速库，支持python调用，支持baichuan, glm, llama, moss基座，手机端流畅运行chatglm-6B级模型单卡可达10000+token / s，
☆45Updated 2 years ago
ssbuild / deep_training
deep learning
☆148Updated 5 months ago
yongzhuo / qwen2-sft
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
☆68Updated last year
ninehills / langeval
Evaluation for AI apps and agent
☆43Updated last year
billvsme / my_openai_api
部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ，实现了OpenAI中Chat, Models和Completions接口，包含流式响…
☆96Updated last year
taishan1994 / qlora-chinese-LLM
使用qlora对中文大语言模型进行微调，包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE
☆90Updated 2 years ago
seanzhang-zhichen / baichuan-Dynamic-NTK-ALiBi
百川Dynamic NTK-ALiBi的代码实现：无需微调即可推理更长文本
☆49Updated 2 years ago
RapidAI / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆68Updated 2 years ago