An implementation of the Llama architecture, to instruct and delight
☆21May 31, 2025Updated 11 months ago
Alternatives and similar repositories for lovely-llama
Users that are interested in lovely-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Flexibly track outputs and grad-outputs of torch.nn.Module.☆13Oct 6, 2023Updated 2 years ago
- Easily run PyTorch on multiple GPUs & machines☆60May 2, 2026Updated 3 weeks ago
- ☆18Mar 18, 2024Updated 2 years ago
- ☆14Sep 21, 2022Updated 3 years ago
- Fast, simple, cryptographically strong random numbers in C++. Experimental.☆19Dec 12, 2013Updated 12 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- seqax = sequence modeling + JAX☆191Jul 23, 2025Updated 10 months ago
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 8 months ago
- ☆24Jun 18, 2024Updated last year
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Aug 2, 2024Updated last year
- Frechet inception distance (FID) evaluation in JAX☆14May 28, 2024Updated 2 years ago
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- ☆20Apr 24, 2026Updated last month
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Minimalistic, hackable PyTorch implementation of SimSiam in ~400 lines. Achieves good performance on ImageNet with ResNet50. Features dis…☆22Nov 25, 2024Updated last year
- Minimal but scalable implementation of large language models in JAX☆34Nov 28, 2025Updated 6 months ago
- ☆28Nov 18, 2022Updated 3 years ago
- ☆19May 16, 2026Updated 2 weeks ago
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- LaTeX templates I created for authoring research papers☆16Mar 19, 2019Updated 7 years ago
- ☆19Dec 4, 2025Updated 5 months ago
- ☆12Jan 4, 2024Updated 2 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Sep 24, 2025Updated 8 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A toolkit for scaling law research ⚖☆63Jan 27, 2025Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆192Jan 19, 2026Updated 4 months ago
- JAX bindings for Flash Attention v2☆106Feb 28, 2026Updated 3 months ago
- ☆50Jan 18, 2024Updated 2 years ago
- gzip Predicts Data-dependent Scaling Laws☆35May 28, 2024Updated 2 years ago
- ML Benchmarks in Algebraic Combinatorics☆24Jan 15, 2026Updated 4 months ago
- ☆29Jan 17, 2025Updated last year
- real time recommendation playground☆15Nov 7, 2022Updated 3 years ago
- Trace instruction execution using perf breakpoints in Python☆24Dec 3, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆25Sep 29, 2024Updated last year
- An experimentation platform for LLM inference optimisation☆36Sep 19, 2024Updated last year
- [NeurIPS 2022] Supervising the Multi-Fidelity Race of Hyperparameter Configurations☆13Apr 25, 2023Updated 3 years ago
- ☆93Aug 18, 2024Updated last year
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆78Dec 3, 2024Updated last year
- Simple Transformer in Jax☆143Jun 22, 2024Updated last year
- ☆45Jun 19, 2024Updated last year