AXERA-TECH / ax-llm
Explore LLM model deployment based on AXera's AI chips
☆69Updated 2 weeks ago
Alternatives and similar repositories for ax-llm:
Users that are interested in ax-llm are comparing it to the libraries listed below
- ☆20Updated last year
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆44Updated 11 months ago
- Samples code for world class Artificial Intelligence SoCs for computer vision applications.☆233Updated 2 months ago
- linux bsp app & sample for axpi (ax620a)☆34Updated last year
- ☆37Updated 6 months ago
- llm deploy project based onnx.☆30Updated 3 months ago
- ☆22Updated last year
- DDK for Rockchip NPU☆62Updated 4 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆41Updated last year
- ☆24Updated 2 years ago
- ☆10Updated 5 months ago
- run ChatGLM2-6B in BM1684X☆49Updated 10 months ago
- Large Language Model Onnx Inference Framework☆28Updated this week
- A Toolkit to Help Optimize Onnx Model☆101Updated this week
- an example of segment-anything infer by ncnn☆120Updated last year
- 使用NCNN推理框架和ByteTrack目标跟踪框架,对网络、文件流URL进行实时性视频推理,而UI界面则由Qt框架实现☆21Updated 3 months ago
- ☆124Updated last year
- Run generative AI models in sophgo BM1684X☆152Updated this week
- ppstructure deploy by ncnn☆27Updated 6 months ago
- MegEngine到其他框架的转换器☆69Updated last year
- ☆21Updated 3 weeks ago
- c++实现的clip推理,模型有一点点改动,但是不大,改动和导出模型的代码可以在readme里找到,模型文件都在Releases里,包括AX650的模型。新增支持ChineseCLIP☆29Updated 3 weeks ago
- ☆32Updated 5 months ago
- stable diffusion using mnn☆65Updated last year
- OneFlow->ONNX☆42Updated last year
- The rknn2 API uses the secondary encapsulation of the process, which is easy for everyone to call. It is applicable to rk356x rk3588☆44Updated 2 years ago
- A collection of pre-compiled, state-of-the-art models in the AXera‘s format☆21Updated last year
- 将MNN拆解的简易前向推理框架(for study!)☆20Updated 3 years ago
- ☆23Updated last year