C++ implementations for various tokenizers (sentencepiece, tiktoken etc).
☆49Apr 21, 2026Updated last week
Alternatives and similar repositories for tokenizers
Users that are interested in tokenizers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- C++ implementation of tokenizers, including tiktoken.☆25Dec 7, 2023Updated 2 years ago
- ☆41Mar 18, 2026Updated last month
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- GPT-Sovits的c++实现版本☆22Jan 9, 2026Updated 3 months ago
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- C++ version of ailia models repository☆25Dec 31, 2025Updated 4 months ago
- 2020中兴捧月阿尔法赛道多目标检测和跟踪初赛第一名方案☆33Oct 3, 2023Updated 2 years ago
- learn TensorRT from scratch🥰☆18Sep 29, 2024Updated last year
- TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。☆15Jun 1, 2023Updated 2 years ago
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- ☆12Jan 25, 2023Updated 3 years ago
- Run Chinese MobileBert model on SNPE.☆15May 19, 2023Updated 2 years ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆12Jun 30, 2023Updated 2 years ago
- ☆12Dec 16, 2021Updated 4 years ago
- A gesture recognition module trained from scratch using Pytorch, deployed with ncnn and TensorRT.☆13May 1, 2022Updated 4 years ago
- ☆16Mar 24, 2025Updated last year
- ☆42Jun 25, 2020Updated 5 years ago
- c++实现的clip推理,模型有一点点改动,但是不大,改动和导出模型的代码可以在readme里找到,模型文件都在Releases里,包括AX650的模型。新增支持ChineseCLIP☆31Jun 19, 2025Updated 10 months ago
- ☆14Feb 3, 2022Updated 4 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- INTERVAL field for PostgreSQL (and an approximation for other backends)☆21Jul 27, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The repository supports TensorRT, QNN platform inference, 2D obstacle detection yolo series (yolov5, yolov8, yolo11, yolox), semantic seg…☆20May 6, 2025Updated 11 months ago
- ☆34Jan 9, 2026Updated 3 months ago
- A Triton JIT runtime and ffi provider in C++☆33Updated this week
- handle gguf files☆13Aug 14, 2025Updated 8 months ago
- only contain face detect 、5/81 points 、face recognization models☆11Jul 9, 2020Updated 5 years ago
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 5 months ago
- segmentation algorithm yolact use tensorrt deploy☆14May 7, 2022Updated 3 years ago
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆84May 26, 2025Updated 11 months ago
- Example apps and demos using PyTorch's ExecuTorch framework☆75Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Community bot☆12Feb 25, 2023Updated 3 years ago
- ☆17Jan 1, 2024Updated 2 years ago
- 车道线检测Lanenet TensorRT加速C++实现☆23Feb 24, 2022Updated 4 years ago
- ☆15Nov 14, 2023Updated 2 years ago
- High-speed and easy-use LLM serving framework for local deployment☆149Aug 7, 2025Updated 8 months ago
- For 2022 Nvidia Hackathon☆22Jun 28, 2022Updated 3 years ago
- ☆25Sep 19, 2025Updated 7 months ago