viitrix / vt-transformer
Transformer framework for edge computing based on C++.
☆124Updated 3 months ago
Alternatives and similar repositories for vt-transformer:
Users that are interested in vt-transformer are comparing it to the libraries listed below
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆233Updated last week
- ☆309Updated 2 months ago
- ☆107Updated 10 months ago
- OpenLLaMA-Chinese, a permissively licensed open source instruction-following models based on OpenLLaMA☆66Updated last year
- ☆206Updated this week
- run ChatGLM2-6B in BM1684X☆49Updated 11 months ago
- C++ implementation of Qwen-LM☆577Updated 2 months ago
- ☆39Updated 3 months ago
- ☆90Updated last year
- 支持中文场景的的小语言模型 llama2.c-zh☆145Updated 11 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆60Updated 7 months ago
- llm-export can export llm model to onnx.☆264Updated last month
- GLM Series Edge Models☆128Updated this week
- Phi3 中文仓库☆323Updated 2 months ago
- llm deploy project based onnx.☆32Updated 4 months ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆135Updated 10 months ago
- An easy-to-use framework for modular RAG☆320Updated this week
- Efficient AI Inference & Serving☆467Updated last year
- qwen2 and llama3 cpp implementation☆40Updated 8 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆214Updated last week
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆440Updated 4 months ago
- LLM101n: Let's build a Storyteller 中文版☆124Updated 6 months ago
- A high performance, high expansion, easy to use framework for AI application. 为AI应用的开发者提供一套统一的高性能、易用的编程框架,快速基于AI全栈服务、开发跨端边云的AI行业应用,支持GPU,…☆146Updated 8 months ago
- Mixture-of-Experts (MoE) Language Model☆184Updated 5 months ago
- 视频分类标注、视频时空标注☆34Updated last year
- Serving Inside Pytorch☆155Updated this week
- Explore LLM model deployment based on AXera's AI chips☆82Updated last week
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆36Updated 2 months ago
- ☆225Updated 9 months ago
- Its an open source LLM based on MOE Structure.☆58Updated 7 months ago