Explore LLM model deployment based on AXera's AI chips
☆151May 6, 2026Updated this week
Alternatives and similar repositories for ax-llm
Users that are interested in ax-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Samples code for world class Artificial Intelligence SoCs for computer vision applications.☆290Apr 29, 2026Updated last week
- The docs repository of Pulsar2 which is AXera's SoC 2rd AI toolchain. Such as AX650A, AX650N☆17Apr 23, 2026Updated 2 weeks ago
- ☆28Jun 30, 2025Updated 10 months ago
- Demo for Qwen2.5-VL-3B-Instruct on Axera device.☆15Sep 3, 2025Updated 8 months ago
- Converts CLIP models to ONNX☆11Jan 17, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆12Aug 11, 2025Updated 8 months ago
- the python api for axengine runtime☆26Mar 24, 2026Updated last month
- linux bsp app & sample for axpi (ax620a)☆36Jun 21, 2023Updated 2 years ago
- ☆23Jan 3, 2024Updated 2 years ago
- OpenAI Whisper demo on Axera☆15Jan 15, 2026Updated 3 months ago
- Linux BSP APP & Samples for AXera Pi Zero(AX620Q)☆22Nov 1, 2024Updated last year
- M5Stack dropdown menu code sample☆15Feb 5, 2019Updated 7 years ago
- ☆69Apr 10, 2026Updated 3 weeks ago
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆11Sep 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆16Aug 15, 2024Updated last year
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆20Aug 3, 2025Updated 9 months ago
- A SDK to using the Realtime API with Microcontrollers like the ESP32☆23Dec 24, 2024Updated last year
- MegEngine到其他框架的转换器☆71Apr 27, 2023Updated 3 years ago
- ☆125Dec 15, 2023Updated 2 years ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- c++实现的clip推理,模型有一点点改动,但是不大,改动和导出模型的代码可以在readme里找到,模型文件都在Releases里,包括AX650的模型。新增支持ChineseCLIP☆31Jun 19, 2025Updated 10 months ago
- ArduPlane, ArduCopter, ArduRover source☆10Updated this week
- ☆16Nov 6, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- stable diffusion using mnn☆68Sep 28, 2023Updated 2 years ago
- Axera download protocol (AXDL) implementation in Rust (Unofficial)☆18Apr 4, 2025Updated last year
- 3D rendering by M5Stack☆26Apr 8, 2019Updated 7 years ago
- llm-export can export llm model to onnx.☆350Oct 24, 2025Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- The Pipeline example based on AX650N/AX8850 shows the software development skills of Image Processing, NPU, Codec, and Display modules, …☆16Apr 27, 2026Updated last week
- Handy tools & graphics API abstraction for blazing fast prototyping☆10Jan 17, 2024Updated 2 years ago
- ☆1,432Mar 12, 2026Updated last month
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆28Jun 27, 2023Updated 2 years ago
- linux bsp app & sample for axpi pro (ax650n)☆31Nov 12, 2024Updated last year
- Arduino library for M5Stack LLM Module☆34Oct 29, 2025Updated 6 months ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆482Oct 23, 2024Updated last year
- ncnn和pnnx格式编辑器☆147Apr 21, 2026Updated 2 weeks ago
- a single-header math library☆17Nov 7, 2025Updated 6 months ago
- ☆24Jul 31, 2025Updated 9 months ago