☆71Feb 27, 2023Updated 3 years ago
Alternatives and similar repositories for huggingface-tokenizer-in-cxx
Users that are interested in huggingface-tokenizer-in-cxx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Universal cross-platform tokenizers binding to HF and sentencepiece☆467Feb 20, 2026Updated last month
- C++ implementation of tokenizers, including tiktoken.☆25Dec 7, 2023Updated 2 years ago
- Converting Chinese sentences into pinyin sequences, implemented in C++, very fast and easy to deploy.☆21Jan 5, 2026Updated 2 months ago
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆18Apr 7, 2022Updated 3 years ago
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆11Sep 10, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Port of Funasr's Paraformer model in C/C++☆40Jun 19, 2024Updated last year
- HuggingFace Transformers WordPiece Tokenizer in C++☆21Mar 14, 2025Updated last year
- Source code of our implementation of the concurrent RMA☆12May 23, 2019Updated 6 years ago
- Deploy SQLFlow service mesh on Windows, macOS, and Linux desktop computers☆12Aug 14, 2023Updated 2 years ago
- A four-dimensional Analysis of Partitioned Approximate Filters☆11Aug 6, 2025Updated 7 months ago
- A SQLite extension for working with float and binary vectors. Work in progress!☆24Feb 10, 2023Updated 3 years ago
- 基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用(Implement speech recognition based on Sherpa-ONNX by downloading the model online.)☆28Feb 27, 2025Updated last year
- ☆14May 4, 2017Updated 8 years ago
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆20Aug 3, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆26May 22, 2023Updated 2 years ago
- implement bert in pure c++☆37Apr 29, 2020Updated 5 years ago
- Grizzly: Efficient Stream Processing Through Adaptive Query Compilation☆16Jun 13, 2020Updated 5 years ago
- OneFlow Serving☆20Apr 10, 2025Updated 11 months ago
- A Android client of Stable Diffusion.☆13Mar 29, 2024Updated 2 years ago
- Run Chinese MobileBert model on SNPE.☆15May 19, 2023Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Dec 4, 2023Updated 2 years ago
- 不依赖 Go-Spring 框架的 Web 模块☆14Aug 8, 2020Updated 5 years ago
- Recording models☆12Sep 19, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆16Jan 24, 2025Updated last year
- C++ SDK for Milvus☆54Mar 19, 2026Updated last week
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆16Sep 20, 2024Updated last year
- Another reverse proxy that provides authentication with OpenID Connect☆10Jul 10, 2023Updated 2 years ago
- qwen2 and llama3 cpp implementation☆50Jun 7, 2024Updated last year
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- ☆151Jan 9, 2025Updated last year
- 単眼深度推定モデルのLite-MonoのPythonでのONNX推論サンプル☆22Apr 12, 2023Updated 2 years ago
- Code for our paper "Evaluating SIMD Compiler-Intrinsics for Database Systems"☆16Jul 5, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆34Apr 29, 2019Updated 6 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM 模型搭建及优化☆43Oct 20, 2023Updated 2 years ago
- ☆13Nov 27, 2025Updated 4 months ago
- ☆16Apr 30, 2025Updated 11 months ago
- RDMA Optimization on MXNet☆14Nov 12, 2017Updated 8 years ago
- Linux打包安装最新版 WPS365, 支持一键安装脚本☆22Dec 3, 2024Updated last year
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Feb 14, 2026Updated last month