pytorch-labs / tokenizersLinks
C++ implementations for various tokenizers (sentencepiece, tiktoken etc).
☆31Updated this week
Alternatives and similar repositories for tokenizers
Users that are interested in tokenizers are comparing it to the libraries listed below
Sorting:
- Whisper in TensorRT-LLM☆16Updated last year
- Yet another Polyhedra Compiler for DeepLearning☆19Updated 2 years ago
- ONNX Command-Line Toolbox☆35Updated 8 months ago
- ☆11Updated last year
- ☆42Updated 5 years ago
- Wanwu models release, code will be released soon☆24Updated 2 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆23Updated last year
- Tengine 管子是用来快速生产 demo 的辅助工具☆13Updated 3 years ago
- ☆13Updated 2 years ago
- ☆24Updated 2 years ago
- TVMScript kernel for deformable attention☆25Updated 3 years ago
- OneFlow->ONNX☆43Updated 2 years ago
- Awesome code, projects, books, etc. related to CUDA☆17Updated last week
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- Model compression for ONNX☆96Updated 7 months ago
- An object detection codebase based on MegEngine.☆28Updated 2 years ago
- ☆29Updated 4 months ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 3 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆72Updated last week
- ☆69Updated 2 years ago
- Describing How to Enable OpenVINO Execution Provider for ONNX Runtime☆20Updated 4 years ago
- Static analysis framework for analyzing programs written in TVM's Relay IR.☆28Updated 5 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated last year
- ☆18Updated 2 weeks ago
- ☆24Updated 3 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- ☆27Updated last week
- Arch-Net: Model Distillation for Architecture Agnostic Model Deployment☆22Updated 3 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆110Updated 9 months ago
- quantize aware training package for NCNN on pytorch☆70Updated 3 years ago