An implementation of the Llama architecture, to instruct and delight
☆21May 31, 2025Updated 10 months ago
Alternatives and similar repositories for lovely-llama
Users that are interested in lovely-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Flexibly track outputs and grad-outputs of torch.nn.Module.☆13Oct 6, 2023Updated 2 years ago
- Turn jitted jax functions back into python source code☆23Dec 16, 2024Updated last year
- Easily run PyTorch on multiple GPUs & machines☆60Jan 8, 2026Updated 3 months ago
- ☆18Mar 18, 2024Updated 2 years ago
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Fast, simple, cryptographically strong random numbers in C++. Experimental.☆19Dec 12, 2013Updated 12 years ago
- A repo to do interpretability of pre-trained acoustic models☆15Oct 15, 2023Updated 2 years ago
- seqax = sequence modeling + JAX☆188Jul 23, 2025Updated 8 months ago
- JAX implementation of the Mistral 7b v0.2 model☆35Jul 3, 2024Updated last year
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 7 months ago
- ☆24Jun 18, 2024Updated last year
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Aug 2, 2024Updated last year
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- ☆18Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 4 months ago
- ☆28Nov 18, 2022Updated 3 years ago
- ☆17Apr 3, 2026Updated 2 weeks ago
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- Comp 781 Project☆10Jan 2, 2026Updated 3 months ago
- ☆19Feb 25, 2026Updated last month
- LaTeX templates I created for authoring research papers☆16Mar 19, 2019Updated 7 years ago
- ☆54May 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Hex encode & decode a string, right from your terminal.☆10Jan 5, 2023Updated 3 years ago
- ☆12Jan 4, 2024Updated 2 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆190Jan 19, 2026Updated 3 months ago
- ☆47Feb 26, 2026Updated last month
- JAX bindings for Flash Attention v2☆104Feb 28, 2026Updated last month
- ☆14Mar 31, 2024Updated 2 years ago
- Scalable Automatic Visual Summarization of Concepts in Deep Neural Networks☆18May 11, 2022Updated 3 years ago
- ☆49Jan 18, 2024Updated 2 years ago
- A modular system for machinable research code☆35Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- gzip Predicts Data-dependent Scaling Laws☆35May 28, 2024Updated last year
- ☆28Jan 17, 2025Updated last year
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆25Sep 29, 2024Updated last year
- ☆91Aug 18, 2024Updated last year
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆78Dec 3, 2024Updated last year
- Simple Transformer in Jax☆143Jun 22, 2024Updated last year
- ☆45Jun 19, 2024Updated last year