hscspring / bytepiece-rsLinks
The Bytepiece Tokenizer Implemented in Rust.
☆14Updated last year
Alternatives and similar repositories for bytepiece-rs
Users that are interested in bytepiece-rs are comparing it to the libraries listed below
Sorting:
- ☆20Updated 10 months ago
- implement llava using candle☆15Updated last year
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- Manages vllm-nccl dependency☆17Updated last year
- RWKV models and examples powered by candle.☆19Updated 5 months ago
- Port of Andrej Karpathy's minbpe to Rust☆25Updated last year
- Code for KaLM-Embedding models☆89Updated last month
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- Fast serverless LLM inference, in Rust.☆88Updated 5 months ago
- GPU based FFT written in Rust and CubeCL☆23Updated 2 months ago
- This repository provides an implementation of "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction B…☆73Updated last month
- Implementation of the RWKV language model in pure WebGPU/Rust.☆314Updated last month
- Longitudinal Evaluation of LLMs via Data Compression☆32Updated last year
- Imitate OpenAI with Local Models☆88Updated 11 months ago
- Langport is a language model inference service☆94Updated 11 months ago
- RWKV in nanoGPT style☆191Updated last year
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆80Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆148Updated last year
- ☆24Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆64Updated last year
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)☆17Updated last year
- This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/…☆97Updated last year
- A more efficient GLM implementation!☆55Updated 2 years ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆67Updated 2 years ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆46Updated 8 months ago
- Rust crate for some audio utilities☆26Updated 5 months ago
- Locality Sensitive Hashing☆72Updated 2 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated last year
- A high-performance constrained decoding engine based on context free grammar in Rust☆54Updated 2 months ago