C++ implementations for various tokenizers (sentencepiece, tiktoken etc).
☆49Updated this week
Alternatives and similar repositories for tokenizers
Users that are interested in tokenizers are comparing it to the libraries listed below
Sorting:
- C++ implementation of tokenizers, including tiktoken.☆25Dec 7, 2023Updated 2 years ago
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- ☆23Jan 5, 2026Updated last month
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 3 years ago
- only contain face detect 、5/81 points 、face recognization models☆11Jul 9, 2020Updated 5 years ago
- A gesture recognition module trained from scratch using Pytorch, deployed with ncnn and TensorRT.☆13May 1, 2022Updated 3 years ago
- ☆12Jan 25, 2023Updated 3 years ago
- c++实现的clip推理,模型有一点点改动,但是不大,改动和导出模型的代码可以在readme里找到,模型文件都在Releases里,包括AX650的模型。新增支持ChineseCLIP☆31Jun 19, 2025Updated 8 months ago
- Smaller and faster nanochat in MLX☆37Nov 15, 2025Updated 3 months ago
- ☆12Dec 16, 2021Updated 4 years ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated last year
- TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。☆15Jun 1, 2023Updated 2 years ago
- ☆16Mar 24, 2025Updated 11 months ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆12Jun 30, 2023Updated 2 years ago
- ☆14Feb 3, 2022Updated 4 years ago
- Simple pytorch classification baselines for MNIST, CIFAR and ImageNet☆19Aug 7, 2019Updated 6 years ago
- learn TensorRT from scratch🥰☆18Sep 29, 2024Updated last year
- ☆17Jan 1, 2024Updated 2 years ago
- A Triton JIT runtime and ffi provider in C++☆31Updated this week
- The repository supports TensorRT, QNN platform inference, 2D obstacle detection yolo series (yolov5, yolov8, yolo11, yolox), semantic seg…☆20May 6, 2025Updated 9 months ago
- ☆30Jan 9, 2026Updated last month
- Run Chinese MobileBert model on SNPE.☆15May 19, 2023Updated 2 years ago
- ☆42Jun 25, 2020Updated 5 years ago
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 3 months ago
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆81May 26, 2025Updated 9 months ago
- segmentation algorithm yolact use tensorrt deploy☆14May 7, 2022Updated 3 years ago
- Keypoints-detection in tensorflow and tensorRT C++☆15Mar 4, 2020Updated 5 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- This is an extension for Visual Studio Code to display OpenCV images while debugging☆18Sep 5, 2023Updated 2 years ago
- Convert ANY IR to ONNX format☆25Feb 12, 2026Updated 2 weeks ago
- For 2022 Nvidia Hackathon☆22Jun 28, 2022Updated 3 years ago
- 使用TensorRT部署SlowFast模型☆24Mar 2, 2022Updated 4 years ago
- 分层解耦的深度学习推理引擎☆79Feb 17, 2025Updated last year
- YOLOv5 Quantization Aware Training with TensorRT☆19Jan 10, 2023Updated 3 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- High Performan Ai Model Web Server. Mainly support computer vision model. Quickly establish your own ai-model server. https://github.com/…☆45May 13, 2025Updated 9 months ago
- The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…☆49Oct 5, 2022Updated 3 years ago