thecharlieblake/lovely-llama

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thecharlieblake/lovely-llama)

thecharlieblake / lovely-llama

An implementation of the Llama architecture, to instruct and delight

☆21

Alternatives and similar repositories for lovely-llama

Users that are interested in lovely-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

graphcore-research / pytorch-tensor-tracker
View on GitHub
Flexibly track outputs and grad-outputs of torch.nn.Module.
☆13Oct 6, 2023Updated 2 years ago
dlwh / jax_sourceror
View on GitHub
Turn jitted jax functions back into python source code
☆23Dec 16, 2024Updated last year
apoorvkh / torchrunx
View on GitHub
Easily run PyTorch on multiple GPUs & machines
☆60May 2, 2026Updated 2 months ago
sekstini / basedxl
View on GitHub
☆18Mar 18, 2024Updated 2 years ago
caillonantoine / NIME_workshop
View on GitHub
☆14Sep 21, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
er537 / whisper_interpretability
View on GitHub
A repo to do interpretability of pre-trained acoustic models
☆15Oct 15, 2023Updated 2 years ago
yixiaoer / mistral-v0.2-jax
View on GitHub
JAX implementation of the Mistral 7b v0.2 model
☆35Jul 3, 2024Updated 2 years ago
graphcore-research / jax-experimental
View on GitHub
JAX for Graphcore IPU (experimental)
☆21Mar 12, 2024Updated 2 years ago
MatX-inc / seqax
View on GitHub
seqax = sequence modeling + JAX
☆195Jul 23, 2025Updated last year
cloneofsimo / efae
View on GitHub
☆24Jun 18, 2024Updated 2 years ago
JonasGeiping / linear_cross_entropy_loss
View on GitHub
A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.
☆75Aug 2, 2024Updated last year
kvfrans / jax-fid-parallel
View on GitHub
Frechet inception distance (FID) evaluation in JAX
☆14May 28, 2024Updated 2 years ago
nestordemeure / jochastic
View on GitHub
A JAX implementation of stochastic addition.
☆13Aug 15, 2022Updated 3 years ago
wesg52 / universal-neurons
View on GitHub
Universal Neurons in GPT2 Language Models
☆30May 28, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jax-ml / australis
View on GitHub
☆28Nov 18, 2022Updated 3 years ago
young-geng / mintext
View on GitHub
Minimal but scalable implementation of large language models in JAX
☆34Nov 28, 2025Updated 8 months ago
google / drjax
View on GitHub
☆19Jul 8, 2026Updated 3 weeks ago
1iyiwei / research-templates
View on GitHub
LaTeX templates I created for authoring research papers
☆16Mar 19, 2019Updated 7 years ago
ethansmith2000 / TransformerExperiments
View on GitHub
☆19Dec 4, 2025Updated 7 months ago
yixiaoer / einshard
View on GitHub
Einsum-like high-level array sharding API for JAX
☆35Jul 16, 2024Updated 2 years ago
berlino / seq_icl
View on GitHub
☆54May 20, 2024Updated 2 years ago
davisrbr / conjectures-arxiv
View on GitHub
OpenConjecture, a dataset of mathematics conjectures pulled from papers published to the ArXiv
☆16Jul 12, 2026Updated 2 weeks ago
lucas-maes / nano-simsiam
View on GitHub
Minimalistic, hackable PyTorch implementation of SimSiam in ~400 lines. Achieves good performance on ImageNet with ResNet50. Features dis…
☆22Nov 25, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
huggingface / candle-paged-attention
View on GitHub
☆12Jan 4, 2024Updated 2 years ago
young-geng / tpu_pod_commander
View on GitHub
TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.
☆20Sep 24, 2025Updated 10 months ago
mrtzh / Ladder.jl
View on GitHub
A reliable leaderboard algorithm for machine learning competitions
☆17May 19, 2015Updated 11 years ago
EleutherAI / nanoGPT-mup
View on GitHub
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆200Jan 19, 2026Updated 6 months ago
nshepperd / flash_attn_jax
View on GitHub
JAX bindings for Flash Attention v2
☆108Feb 28, 2026Updated 5 months ago
google-deepmind / asyncdiloco
View on GitHub
☆51Jan 18, 2024Updated 2 years ago
BrachioLab / Meerkat
View on GitHub
An agent for auditing repositories of traces for violations of safety properties. Automatically finds cheating (task-level gaming and har…
☆15Jun 6, 2026Updated last month
hearbenchmark / hear-baseline
View on GitHub
Simple baseline model for the HEAR benchmark
☆23Feb 17, 2026Updated 5 months ago
joey00072 / microjax
View on GitHub
Jax like function transformation engine but micro, microjax
☆34Oct 25, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
KhoomeiK / complexity-scaling
View on GitHub
gzip Predicts Data-dependent Scaling Laws
☆35May 28, 2024Updated 2 years ago
aj-talaei / GPU_Programming_Specialization
View on GitHub
This repository contains my coursework and projects completed during the GPU Programming Specialization offered by Johns Hopkins Universi…
☆11Jun 13, 2023Updated 3 years ago
Jokeren / triton-samples
View on GitHub
☆29Jan 17, 2025Updated last year
victusfate / concierge
View on GitHub
real time recommendation playground
☆15Nov 7, 2022Updated 3 years ago
aredden / torch-cublas-hgemm
View on GitHub
PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu
☆78Dec 3, 2024Updated last year
RobertCsordas / moeut
View on GitHub
☆93Aug 18, 2024Updated last year
AllanYangZhou / midGPT
View on GitHub
Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.
☆27Sep 29, 2024Updated last year