chunhuizhang / deeplearning-envsLinks
深度学习软硬件配置(小白向)
☆30Updated 2 weeks ago
Alternatives and similar repositories for deeplearning-envs
Users that are interested in deeplearning-envs are comparing it to the libraries listed below
Sorting:
- accelerate generating vector by using onnx model☆17Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- AFAC2024金融智能创新大赛☆37Updated 6 months ago
- ☆41Updated 2 months ago
- BLOOM 模型的指令微调☆24Updated last year
- 模型压缩的小白入门教程☆22Updated 11 months ago
- ChatGLM2-6B-Explained☆35Updated last year
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆46Updated last week
- Tutorial for Ray☆25Updated last year
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。☆44Updated last year
- 2023全球智能汽车AI挑战赛——赛道一:AI大模型检索问答, 75+ baseline☆57Updated last year
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- 大型语言模型实战指南:应用实践与场 景落地☆71Updated 8 months ago
- pytorch分布式训练☆66Updated last year
- A more efficient GLM implementation!☆55Updated 2 years ago
- A minimalist benchmarking tool designed to test the routine-generation capabilities of LLMs.☆24Updated 6 months ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆27Updated last year
- ☆14Updated last year
- Recursive Abstractive Processing for Tree-Organized Retrieval☆9Updated last year
- my notebook !☆37Updated 6 years ago
- 机器学习基础☆8Updated 6 years ago
- TensorRT☆11Updated 4 years ago
- 最少使用 3090 即可训练自己的比特大脑(miniLLM)🧠(进行中). Train your own BitBrain(A mini LLM) with just an RTX 3090 minimum.☆19Updated last week
- qwen models finetuning☆98Updated 2 months ago
- 大语言模型训练和服务调研☆37Updated last year
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆63Updated last year
- 演示Gemma中文指令微调的教程☆46Updated last year
- Another ChatGLM2 implementation for GPTQ quantization☆53Updated last year
- 百度QA100万数据集☆47Updated last year
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year