Header-only safetensors loader and saver in C++
☆86Dec 27, 2025Updated 4 months ago
Alternatives and similar repositories for safetensors-cpp
Users that are interested in safetensors-cpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Zero Dependency LibTorch Safetensors Loading and Storing in C++☆23Jul 12, 2024Updated last year
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated 2 years ago
- ☆10Jul 18, 2024Updated last year
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- C++ implementation of tokenizers, including tiktoken.☆25Dec 7, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated 2 years ago
- ☆23Jan 3, 2024Updated 2 years ago
- A one-page-only CGraph-API-liked DAG project.☆27Feb 11, 2025Updated last year
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- snpe tutorial☆10Dec 25, 2023Updated 2 years ago
- llm deploy project based onnx.☆49Oct 9, 2024Updated last year
- Dockerfiles for poetry/mlc-llm(rk3588)/...☆10Sep 13, 2023Updated 2 years ago
- ☆42Nov 29, 2022Updated 3 years ago
- Learning to See in the Dark running in Android by ncnn with Raw Camera☆27Feb 20, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- A simple and fast minimalistic header-only library allowing to run async tasks and execute task graphs.☆65Nov 29, 2024Updated last year
- A Triton JIT runtime and ffi provider in C++☆35Updated this week
- 基于 CUDA Driver API 的 cuda 运行时环境☆16Jul 30, 2025Updated 9 months ago
- Speed of Light Analysis for ML Model Runtime☆66Apr 13, 2026Updated last month
- ☆17Jan 1, 2024Updated 2 years ago
- 瑞芯微芯片的rknn推理框架部署(yolo模型)☆14Jul 17, 2025Updated 10 months ago
- nsfw & porn detection for mobile in ncnn☆12Dec 24, 2021Updated 4 years ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆15Oct 24, 2023Updated 2 years ago
- PiDiNet running in Android by ncnn☆15Sep 26, 2021Updated 4 years ago
- Optix7 experiments☆10Oct 20, 2019Updated 6 years ago
- ncnn HiFi-GAN☆30Sep 29, 2024Updated last year
- This repository contains multiple implementations of Flash Attention optimized with Triton kernels, showcasing progressive performance im…☆11Mar 26, 2026Updated last month
- Recording models☆12Sep 19, 2023Updated 2 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆314Apr 11, 2024Updated 2 years ago
- C++ implementation of Qwen-LM☆625Dec 6, 2024Updated last year
- ☆33Feb 3, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 5 months ago
- run ChatGLM2-6B in BM1684X☆49Mar 1, 2024Updated 2 years ago
- A high-performance implementation of Empirical Dynamic Modeling (EDM)☆20Feb 25, 2026Updated 2 months ago
- Companion repository for the EUSIPCO-24 accepted paper "Pre-Training Music Classification Models via Music Source Separation"☆12Aug 30, 2024Updated last year
- C++数据流并行处理框架☆23Apr 10, 2021Updated 5 years ago
- TTS inference in C++ based on TFlite model☆20Jan 18, 2021Updated 5 years ago
- ☆20Dec 29, 2023Updated 2 years ago