☆72Feb 27, 2023Updated 3 years ago
Alternatives and similar repositories for huggingface-tokenizer-in-cxx
Users that are interested in huggingface-tokenizer-in-cxx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Universal cross-platform tokenizers binding to HF and sentencepiece☆474Feb 20, 2026Updated last month
- C++ implementation of tokenizers, including tiktoken.☆25Dec 7, 2023Updated 2 years ago
- Converting Chinese sentences into pinyin sequences, implemented in C++, very fast and easy to deploy.☆21Jan 5, 2026Updated 3 months ago
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆18Apr 7, 2022Updated 4 years ago
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆11Sep 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Port of Funasr's Paraformer model in C/C++☆42Jun 19, 2024Updated last year
- HuggingFace Transformers WordPiece Tokenizer in C++☆21Mar 14, 2025Updated last year
- Minimal example of using a traced huggingface transformers model with libtorch☆35Sep 17, 2020Updated 5 years ago
- Source code of our implementation of the concurrent RMA☆12May 23, 2019Updated 6 years ago
- Concurrent (with OLC) Adaptive Radix Trie in Golang.☆11Jul 31, 2020Updated 5 years ago
- Deploy SQLFlow service mesh on Windows, macOS, and Linux desktop computers☆12Aug 14, 2023Updated 2 years ago
- A four-dimensional Analysis of Partitioned Approximate Filters☆11Aug 6, 2025Updated 8 months ago
- Tutorials of Extending and importing TVM with CMAKE Include dependency.☆16Oct 11, 2024Updated last year
- 基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用(Implement speech recognition based on Sherpa-ONNX by downloading the model online.)☆28Feb 27, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A SQLite extension for working with float and binary vectors. Work in progress!☆24Feb 10, 2023Updated 3 years ago
- ☆14May 4, 2017Updated 8 years ago
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆20Aug 3, 2025Updated 8 months ago
- BERT Tokenizer in C++☆79Jan 14, 2021Updated 5 years ago
- Grizzly: Efficient Stream Processing Through Adaptive Query Compilation☆16Jun 13, 2020Updated 5 years ago
- A Android client of Stable Diffusion.☆13Mar 29, 2024Updated 2 years ago
- Fast and customizable text tokenization library with BPE and SentencePiece support☆332Jan 10, 2026Updated 3 months ago
- ☆16Jan 24, 2025Updated last year
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆16Sep 20, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- qwen2 and llama3 cpp implementation☆50Jun 7, 2024Updated last year
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- ☆150Jan 9, 2025Updated last year
- ☆18Dec 7, 2023Updated 2 years ago
- 単眼深度推定モデルのLite-MonoのPythonでのONNX推論サンプル☆22Apr 12, 2023Updated 3 years ago
- Code for our paper "Evaluating SIMD Compiler-Intrinsics for Database Systems"☆16Jul 5, 2023Updated 2 years ago
- Fast Cardinality Estimation of Multi-Join Queries Using Sketches☆16Feb 29, 2024Updated 2 years ago
- ☆34Apr 29, 2019Updated 6 years ago
- ☆13Nov 27, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆17Apr 30, 2025Updated 11 months ago
- PDF screenshot generator for web pages☆28Oct 25, 2024Updated last year
- TTG: Template Task Graph C++ API☆26Apr 11, 2026Updated last week
- Triton backend for https://github.com/OpenNMT/CTranslate2☆35Jul 7, 2023Updated 2 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Feb 14, 2026Updated 2 months ago
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆39Jul 14, 2025Updated 9 months ago
- Parallel Wavelet Tree and Wavelet Matrix Construction☆25Jun 27, 2023Updated 2 years ago