wangzhaode / tokenizer.cppLinks
A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.
☆21Updated last month
Alternatives and similar repositories for tokenizer.cpp
Users that are interested in tokenizer.cpp are comparing it to the libraries listed below
Sorting:
- llm deploy project based onnx.☆49Updated last year
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Updated last year
- ☆33Updated last year
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Updated 2 years ago
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆52Updated 2 years ago
- c++实现的clip推理,模型有一点点改动,但是不大,改动和导出模型的代码可以在readme里找到,模型文件都在Releases里,包括AX650的模型。新增支持ChineseCLIP☆31Updated 7 months ago
- 使用onnxruntime部署实时视频帧插值,包含C++和Python两个版本的程序☆28Updated last year
- Serving Inside Pytorch☆170Updated 2 weeks ago
- HunyuanDiT with TensorRT and libtorch☆18Updated last year
- mnn asr demo.☆25Updated 10 months ago
- Whisper in TensorRT-LLM☆17Updated 2 years ago
- ☆28Updated 7 months ago
- ncnn HiFi-GAN☆29Updated last year
- A Toolkit to Help Optimize Large Onnx Model☆163Updated 3 months ago
- ☆10Updated last year
- ☆19Updated 2 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Updated this week
- qwen2 and llama3 cpp implementation☆49Updated last year
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Updated 2 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆42Updated 3 years ago
- Large Language Model Onnx Inference Framework☆36Updated 2 months ago
- Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5☆16Updated last year
- ☆125Updated 2 years ago
- some ncnn demos of FunASR☆28Updated last year
- Stable Diffusion in TensorRT 8.5+☆15Updated 2 years ago
- an example of segment-anything infer by ncnn☆123Updated 2 years ago
- Run Chinese MobileBert model on SNPE.☆15Updated 2 years ago
- An onnx-based quantitation tool.☆71Updated 2 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Updated last year
- a simple lightweight large language model pipeline framework.☆28Updated 9 months ago