chunhuizhang / deeplearning-envsLinks
深度学习软硬件配置(小白向)
☆34Updated last week
Alternatives and similar repositories for deeplearning-envs
Users that are interested in deeplearning-envs are comparing it to the libraries listed below
Sorting:
- qwen models finetuning☆104Updated 7 months ago
- 介绍docker、docker compose的使用。☆21Updated last year
- 大型语言模型实战指南:应用实践与场景落地☆79Updated last year
- accelerate generating vector by using onnx model☆18Updated last year
- share data, prompt data , pretraining data☆36Updated last year
- 模型压缩的小白入门教程☆22Updated last year
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。☆45Updated 2 years ago
- ChatGLM2-6B-Explained☆36Updated 2 years ago
- Imitate OpenAI with Local Models☆88Updated last year
- 演示Gemma中文指令微调的教程☆46Updated last year
- 千问14B和7B的逐行解释☆62Updated 2 years ago
- ☆106Updated 2 years ago
- 大模型智能体Agent中文教程,博客代码仓库☆39Updated this week
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆48Updated this week
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10Updated last year
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Updated 2 years ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- pytorch分布式训练☆69Updated 2 years ago
- A more efficient GLM implementation!☆54Updated 2 years ago
- Ziya-LLaMA-13B是IDEA基于LLaMa的130亿参数的大规模预训练模型,具备翻译,编程,文本分类,信息抽取,摘要,文案生成,常识问答和数学计算等能力。目前姜子牙通用大模型已完成大规模预训练、多任务有监督微调和人类反馈学习三阶段的训练过程。本文主要用于Ziya-…☆45Updated 2 years ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆110Updated last month
- AFAC2024金融智能创新大赛☆56Updated 10 months ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated 2 years ago
- deep learning☆148Updated 5 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆68Updated last year
- Evaluation for AI apps and agent☆43Updated last year
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆96Updated last year
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆90Updated 2 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆68Updated 2 years ago