An implementation of the Llama architecture, to instruct and delight
☆21May 31, 2025Updated 11 months ago
Alternatives and similar repositories for lovely-llama
Users that are interested in lovely-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Flexibly track outputs and grad-outputs of torch.nn.Module.☆13Oct 6, 2023Updated 2 years ago
- Turn jitted jax functions back into python source code☆23Dec 16, 2024Updated last year
- Easily run PyTorch on multiple GPUs & machines☆60May 2, 2026Updated last week
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- ☆14Sep 21, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A repo to do interpretability of pre-trained acoustic models☆15Oct 15, 2023Updated 2 years ago
- seqax = sequence modeling + JAX☆189Jul 23, 2025Updated 9 months ago
- JAX implementation of the Mistral 7b v0.2 model☆35Jul 3, 2024Updated last year
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 8 months ago
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Aug 2, 2024Updated last year
- Frechet inception distance (FID) evaluation in JAX☆14May 28, 2024Updated last year
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- Minimalistic, hackable PyTorch implementation of SimSiam in ~400 lines. Achieves good performance on ImageNet with ResNet50. Features dis…☆22Nov 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- JAX for Graphcore IPU (experimental)☆22Mar 12, 2024Updated 2 years ago
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 5 months ago
- ☆28Nov 18, 2022Updated 3 years ago
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- ☆54May 20, 2024Updated last year
- ☆12Jan 4, 2024Updated 2 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Sep 24, 2025Updated 7 months ago
- A reliable leaderboard algorithm for machine learning competitions☆17May 19, 2015Updated 10 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆192Jan 19, 2026Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆47Feb 26, 2026Updated 2 months ago
- JAX bindings for Flash Attention v2☆105Feb 28, 2026Updated 2 months ago
- ☆14Mar 31, 2024Updated 2 years ago
- Scalable Automatic Visual Summarization of Concepts in Deep Neural Networks☆18May 11, 2022Updated 3 years ago
- SoundSeek is a new interface to search through your sound file libraries☆22Apr 10, 2018Updated 8 years ago
- ☆50Jan 18, 2024Updated 2 years ago
- Simple baseline model for the HEAR benchmark☆23Feb 17, 2026Updated 2 months ago
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year
- A modular system for machinable research code☆35Apr 29, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- gzip Predicts Data-dependent Scaling Laws☆35May 28, 2024Updated last year
- ☆28Jan 17, 2025Updated last year
- music semantic understanding evaluation benchmark☆25Aug 12, 2023Updated 2 years ago
- ☆92Aug 18, 2024Updated last year
- [NeurIPS 2022] Supervising the Multi-Fidelity Race of Hyperparameter Configurations☆13Apr 25, 2023Updated 3 years ago
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆78Dec 3, 2024Updated last year
- A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations☆16Apr 15, 2024Updated 2 years ago