☆72Feb 27, 2023Updated 3 years ago
Alternatives and similar repositories for huggingface-tokenizer-in-cxx
Users that are interested in huggingface-tokenizer-in-cxx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Universal cross-platform tokenizers binding to HF and sentencepiece☆496May 20, 2026Updated last month
- C++ implementation of tokenizers, including tiktoken.☆25Dec 7, 2023Updated 2 years ago
- GPT2 implementation in C++ using Ort☆26Jan 28, 2021Updated 5 years ago
- Converting Chinese sentences into pinyin sequences, implemented in C++, very fast and easy to deploy.☆23Jan 5, 2026Updated 5 months ago
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆18Apr 7, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆11Sep 10, 2024Updated last year
- ONNX-compatible DocShadow: High-Resolution Document Shadow Removal. Supports TensorRT 🚀☆25Sep 13, 2023Updated 2 years ago
- HuggingFace Transformers WordPiece Tokenizer in C++☆21Mar 14, 2025Updated last year
- Minimal example of using a traced huggingface transformers model with libtorch☆35Sep 17, 2020Updated 5 years ago
- Concurrent (with OLC) Adaptive Radix Trie in Golang.☆12Jul 31, 2020Updated 5 years ago
- A four-dimensional Analysis of Partitioned Approximate Filters☆11Aug 6, 2025Updated 10 months ago
- 基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用(Implement speech recognition based on Sherpa-ONNX by downloading the model online.)☆29Feb 27, 2025Updated last year
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆20Aug 3, 2025Updated 10 months ago
- BERT Tokenizer in C++☆79Jan 14, 2021Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- implement bert in pure c++☆37Apr 29, 2020Updated 6 years ago
- ☆26May 22, 2023Updated 3 years ago
- Grizzly: Efficient Stream Processing Through Adaptive Query Compilation☆16Jun 13, 2020Updated 6 years ago
- OneFlow Serving☆20Apr 10, 2025Updated last year
- Run Chinese MobileBert model on SNPE.☆15May 19, 2023Updated 3 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Dec 4, 2023Updated 2 years ago
- Recording models☆12Sep 19, 2023Updated 2 years ago
- Fast and customizable text tokenization library with BPE and SentencePiece support☆334Jan 10, 2026Updated 5 months ago
- C++ SDK for Milvus☆54Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆16Sep 20, 2024Updated last year
- qwen2 and llama3 cpp implementation☆50Jun 7, 2024Updated 2 years ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- ☆150Jan 9, 2025Updated last year
- ☆18Dec 7, 2023Updated 2 years ago
- 単眼深度推定モデルのLite-MonoのPythonでのONNX推論サンプル☆23Apr 12, 2023Updated 3 years ago
- ☆16Mar 16, 2021Updated 5 years ago
- Code for our paper "Evaluating SIMD Compiler-Intrinsics for Database Systems"☆16Jul 5, 2023Updated 2 years ago
- Fast Cardinality Estimation of Multi-Join Queries Using Sketches☆16Feb 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆34Apr 29, 2019Updated 7 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Oct 20, 2023Updated 2 years ago
- ☆13Nov 27, 2025Updated 7 months ago
- 通过rtmp实现主播功能:采集视频与音频数据推流到rtmp服务器上。☆12Apr 7, 2017Updated 9 years ago
- Triton backend for https://github.com/OpenNMT/CTranslate2☆35Jul 7, 2023Updated 2 years ago
- Sequence algorithms for use in Flashlight.☆14Jan 12, 2026Updated 5 months ago
- Linux打包安装最新版 WPS365, 支持一键安装脚本☆22Dec 3, 2024Updated last year