herrmann / rustorchLinks
"PyTorch in Rust"
☆16Updated last year
Alternatives and similar repositories for rustorch
Users that are interested in rustorch are comparing it to the libraries listed below
Sorting:
- A collection of optimisers for use with candle☆41Updated last month
- Read and write tensorboard data using Rust☆23Updated last year
- implement llava using candle☆15Updated last year
- ☆39Updated 3 years ago
- Make triton easier☆47Updated last year
- Rust Implementation of micrograd☆53Updated last year
- ☆15Updated last year
- ☆19Updated last year
- See https://github.com/cuda-mode/triton-index/ instead!☆11Updated last year
- ☆23Updated 9 months ago
- ☆133Updated last year
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆59Updated 3 years ago
- ☆12Updated 8 months ago
- Port of Andrej Karpathy's minbpe to Rust☆29Updated last year
- Utilities for Training Very Large Models☆58Updated last year
- Modular Rust transformer/LLM library using Candle☆37Updated last year
- Because it's there.☆16Updated last year
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆54Updated 6 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Updated 4 months ago
- Collection of autoregressive model implementation☆86Updated 5 months ago
- Training hybrid models for dummies.☆26Updated last week
- Your one stop CLI for ONNX model analysis.☆47Updated 2 years ago
- ☆28Updated last year
- Supercharge huggingface transformers with model parallelism.☆77Updated 2 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆39Updated 2 years ago
- JAX/Flax implementation of the Hyena Hierarchy☆34Updated 2 years ago
- Visualising Losses in Deep Neural Networks☆16Updated last year
- ☆21Updated 7 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆102Updated 9 months ago