An implementation of the Llama architecture, to instruct and delight
☆21May 31, 2025Updated 9 months ago
Alternatives and similar repositories for lovely-llama
Users that are interested in lovely-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Flexibly track outputs and grad-outputs of torch.nn.Module.☆13Oct 6, 2023Updated 2 years ago
- ☆18Mar 18, 2024Updated 2 years ago
- Fast, simple, cryptographically strong random numbers in C++. Experimental.☆19Dec 12, 2013Updated 12 years ago
- A repo to do interpretability of pre-trained acoustic models☆15Oct 15, 2023Updated 2 years ago
- ☆23Jun 18, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Aug 2, 2024Updated last year
- Frechet inception distance (FID) evaluation in JAX☆14May 28, 2024Updated last year
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- ☆16Feb 24, 2026Updated last month
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- Minimalistic, hackable PyTorch implementation of SimSiam in ~400 lines. Achieves good performance on ImageNet with ResNet50. Features dis…☆21Nov 25, 2024Updated last year
- JAX for Graphcore IPU (experimental)☆22Mar 12, 2024Updated 2 years ago
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16Oct 20, 2025Updated 5 months ago
- ☆19Feb 25, 2026Updated last month
- LaTeX templates I created for authoring research papers☆16Mar 19, 2019Updated 7 years ago
- ☆12Jan 4, 2024Updated 2 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆21Sep 24, 2025Updated 6 months ago
- A reliable leaderboard algorithm for machine learning competitions☆17May 19, 2015Updated 10 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆189Jan 19, 2026Updated 2 months ago
- A toolkit for scaling law research ⚖☆59Jan 27, 2025Updated last year
- JAX bindings for Flash Attention v2☆104Feb 28, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- SoundSeek is a new interface to search through your sound file libraries☆22Apr 10, 2018Updated 7 years ago
- ☆49Jan 18, 2024Updated 2 years ago
- Simple baseline model for the HEAR benchmark☆23Feb 17, 2026Updated last month
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year
- A modular system for machinable research code☆35Apr 12, 2025Updated 11 months ago
- gzip Predicts Data-dependent Scaling Laws☆34May 28, 2024Updated last year
- ML Benchmarks in Algebraic Combinatorics☆25Jan 15, 2026Updated 2 months ago
- ☆28Jan 17, 2025Updated last year
- music semantic understanding evaluation benchmark☆25Aug 12, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆25Sep 29, 2024Updated last year
- An experimentation platform for LLM inference optimisation☆36Sep 19, 2024Updated last year
- ☆91Aug 18, 2024Updated last year
- Simple Transformer in Jax☆143Jun 22, 2024Updated last year
- A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations☆16Apr 15, 2024Updated last year
- ☆306Jul 15, 2024Updated last year
- ☆44Jun 19, 2024Updated last year