viitrix / vt-transformerLinks
Transformer framework for edge computing based on C++.
☆129Updated last year
Alternatives and similar repositories for vt-transformer
Users that are interested in vt-transformer are comparing it to the libraries listed below
Sorting:
- run ChatGLM2-6B in BM1684X☆50Updated last year
- ☆113Updated last year
- ☆337Updated last month
- qwen2 and llama3 cpp implementation☆48Updated last year
- C++ implementation of Qwen-LM☆609Updated 11 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆267Updated 3 months ago
- 支持中文场景的的小语言模型 llama2.c-zh☆150Updated last year
- A high performance, high expansion, easy to use framework for AI application. 为AI应用的开发者提供一套统一的高性能、易用的编程框架,快速基于AI全栈服务、开发跨端边云的AI行业应用,支持GPU,…☆160Updated last year
- GLM Series Edge Models☆153Updated 5 months ago
- Explore LLM model deployment based on AXera's AI chips☆126Updated last week
- Port of Facebook's LLaMA model in C/C++☆103Updated 3 weeks ago
- OpenLLaMA-Chinese, a permissively licensed open source instruction-following models based on OpenLLaMA☆66Updated 2 years ago
- Serving Inside Pytorch☆165Updated last week
- Efficient AI Inference & Serving☆478Updated last year
- ☆52Updated last year
- llm-export can export llm model to onnx.☆330Updated last month
- ☆178Updated this week
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆86Updated last year
- Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。☆295Updated last year
- ☆347Updated last year
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆79Updated last year
- ☆65Updated this week
- ☆79Updated last year
- ☆240Updated 9 months ago
- OrionStar-Yi-34B-Chat 是一款开源中英文Chat模型,由猎户星空基于Yi-34B开源模型、使用15W+高质量语料微调而成。☆262Updated last year
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆89Updated last year
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- An easy-to-use framework for modular RAG☆410Updated this week
- LLM101n: Let's build a Storyteller 中文版☆135Updated last year
- Run generative AI models in sophgo BM1684X/BM1688☆254Updated this week