poudels14 / llama2_rs_old
☆23Updated last year
Alternatives and similar repositories for llama2_rs_old:
Users that are interested in llama2_rs_old are comparing it to the libraries listed below
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆37Updated last year
- ☆126Updated 11 months ago
- ☆60Updated last year
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆75Updated 3 weeks ago
- A sketch of a Transformer in Rust for a blog post☆30Updated 3 years ago
- "PyTorch in Rust"☆16Updated last year
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆79Updated last year
- Rust Implementation of micrograd☆51Updated 9 months ago
- Inference Llama 2 in one file of pure Rust 🦀☆233Updated last year
- ☆37Updated 2 years ago
- Read and write tensorboard data using Rust☆20Updated last year
- Simplex Random Feature attention, in PyTorch☆74Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- ☆28Updated 4 months ago
- ☆137Updated last year
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!☆104Updated last year
- ☆25Updated 3 months ago
- Rust port of llm.c by @karpathy☆41Updated 11 months ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆19Updated 2 weeks ago
- Full finetuning of large language models without large memory requirements☆93Updated last year
- A comprehensive Rust translation of the code from Sebastian Raschka's Build an LLM from Scratch book.☆170Updated this week
- Rust wrapper for Microsoft's ONNX Runtime with CUDA support (version 1.7)☆24Updated 2 years ago
- Make triton easier☆47Updated 9 months ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated 10 months ago
- A collection of optimisers for use with candle☆34Updated 4 months ago
- ☆18Updated 2 years ago
- A high-performance constrained decoding engine based on context free grammar in Rust☆48Updated 3 months ago
- ☆57Updated last year
- ☆27Updated 8 months ago