viitrix / vt-transformerLinks
Transformer framework for edge computing based on C++.
☆124Updated 7 months ago
Alternatives and similar repositories for vt-transformer
Users that are interested in vt-transformer are comparing it to the libraries listed below
Sorting:
- ☆109Updated last year
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆257Updated 3 weeks ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- OpenLLaMA-Chinese, a permissively licensed open source instruction-following models based on OpenLLaMA☆66Updated last year
- ☆310Updated 6 months ago
- 支持中文场景的的小语言模型 llama2.c-zh☆147Updated last year
- Serving Inside Pytorch☆160Updated 2 weeks ago
- Efficient AI Inference & Serving☆471Updated last year
- C++ implementation of Qwen-LM☆595Updated 6 months ago
- Mixture-of-Experts (MoE) Language Model☆189Updated 9 months ago
- Efficient inference of large language models.☆149Updated last week
- Port of Facebook's LLaMA model in C/C++☆97Updated this week
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆139Updated last year
- Phi3 中文后训练模型仓库☆321Updated 7 months ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- Explore LLM model deployment based on AXera's AI chips☆107Updated this week
- ☆27Updated 7 months ago
- ggml implementation of the baichuan13b model (adapted from llama.cpp)☆54Updated last year
- Its an open source LLM based on MOE Structure.☆58Updated 11 months ago
- llm-export can export llm model to onnx.☆297Updated 5 months ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆804Updated 3 weeks ago
- ☆168Updated this week
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- ☆80Updated last year
- ☆133Updated 4 months ago
- ☆349Updated 11 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆73Updated 11 months ago
- ☆90Updated last year
- ☆229Updated last year
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆264Updated last year