hscspring / bytepiece-rsLinks
The Bytepiece Tokenizer Implemented in Rust.
☆14Updated last year
Alternatives and similar repositories for bytepiece-rs
Users that are interested in bytepiece-rs are comparing it to the libraries listed below
Sorting:
- ☆20Updated 10 months ago
- RWKV models and examples powered by candle.☆19Updated 6 months ago
- implement llava using candle☆15Updated last year
- 更纯粹、更高压缩率的Tokenizer in Rust☆13Updated 8 months ago
- This repository provides an implementation of "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction B…☆75Updated last month
- A high-performance constrained decoding engine based on context free grammar in Rust☆56Updated 3 months ago
- Port of Andrej Karpathy's minbpe to Rust☆28Updated last year
- Imitate OpenAI with Local Models☆89Updated last year
- ☆24Updated 4 months ago
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆62Updated 10 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Updated last year
- A more efficient GLM implementation!☆54Updated 2 years ago
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆80Updated last year
- Longitudinal Evaluation of LLMs via Data Compression☆32Updated last year
- Implementation of the RWKV language model in pure WebGPU/Rust.☆314Updated 2 weeks ago
- Code for KaLM-Embedding models☆91Updated 2 months ago
- Fast serverless LLM inference, in Rust.☆90Updated 6 months ago
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆446Updated this week
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆168Updated last week
- Langport is a language model inference service☆94Updated 11 months ago
- Manages vllm-nccl dependency☆17Updated last year
- patches for huggingface transformers to save memory☆27Updated 3 months ago
- A Fish Speech implementation in Rust, with Candle.rs☆95Updated 3 months ago
- Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.☆24Updated 5 months ago
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆102Updated 5 months ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆149Updated last year
- 3x Faster Inference; Unofficial implementation of EAGLE Speculative Decoding☆75Updated 2 months ago
- Rust crate for some audio utilities☆26Updated 5 months ago
- Light local website for displaying performances from different chat models.☆87Updated last year