yinghuo302 / ascend-llm
基于昇腾310芯片的大语言模型部署
☆13Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for ascend-llm
- Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function ind…☆80Updated 8 months ago
- run ChatGLM2-6B in BM1684X☆48Updated 8 months ago
- ☆26Updated 3 weeks ago
- LLM 推理服务性能测试☆27Updated 11 months ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆44Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆40Updated 3 months ago
- llm-export can export llm model to onnx.☆231Updated last week
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆38Updated 2 months ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆48Updated 5 months ago
- simplify >2GB large onnx model☆44Updated 8 months ago
- 基于OpenVINO,本地部署大模型智能体Agent,控制TonyPi人形机器人☆81Updated last week
- ☆26Updated this week
- 基于InternLM2大模型的 离线具身智能导盲犬☆66Updated 7 months ago
- export llama to onnx☆98Updated 5 months ago
- ☆90Updated last year
- ☆13Updated last month
- ☆145Updated this week
- simple decoder-only GTP model in pytorch☆32Updated 6 months ago
- WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge☆62Updated last week
- 「大模型」3小时从0训练27M参数的视觉多模态VLM,个人显卡即可推理训练!☆376Updated this week
- PaddlePaddle custom device implementaion. (『飞桨』自定义硬件接入实现)☆71Updated this week
- Inference code for LLaMA models☆109Updated last year
- 个人项目地址,一些大语言模型和多模态模型的应用☆123Updated 2 weeks ago
- Run generative AI models in sophgo BM1684X☆126Updated this week
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆52Updated 7 months ago
- This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation…☆118Updated 4 months ago
- 本项目是自动化学报中AUTOPLAN的代码地址,使用大语言模型完成了复杂任务的任务规划以及任务执行☆80Updated last week
- ☆22Updated last year
- A toolbox of yolo models and algorithms based on MindSpore☆105Updated this week
- The project is a multi-threaded inference demo of Yolo running on the RK3588 platform, which has been adapted for reading video files and…☆204Updated 2 months ago