poudels14 / llama2_rs_oldLinks
☆23Updated 2 years ago
Alternatives and similar repositories for llama2_rs_old
Users that are interested in llama2_rs_old are comparing it to the libraries listed below
Sorting:
- ☆130Updated last year
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆38Updated 2 years ago
- ☆137Updated last year
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆100Updated 5 months ago
- Rust Implementation of micrograd☆52Updated last year
- implement llava using candle☆15Updated last year
- Inference Llama 2 in one file of pure Rust 🦀☆234Updated last year
- ☆156Updated 2 years ago
- Full finetuning of large language models without large memory requirements☆94Updated last year
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆80Updated last year
- Simplex Random Feature attention, in PyTorch☆74Updated last year
- ☆39Updated 2 years ago
- Read and write tensorboard data using Rust☆21Updated last year
- gzip Predicts Data-dependent Scaling Laws☆35Updated last year
- ☆27Updated last year
- A sketch of a Transformer in Rust for a blog post☆32Updated 3 years ago
- "PyTorch in Rust"☆16Updated last year
- 👷 Build compute kernels☆87Updated last week
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- ☆61Updated last year
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆99Updated 3 weeks ago
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated last year
- Make triton easier☆47Updated last year
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆111Updated last year
- LLaMA from First Principles☆51Updated 2 years ago
- ☆26Updated 2 years ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- Dataflow is a data processing library, primarily for machine learning.☆24Updated 2 years ago
- Fast serverless LLM inference, in Rust.☆88Updated 5 months ago