saurabhaloneai / Llama-3-From-Scratch-In-Pure-Jax
This repository contain the simple llama3 implementation in pure jax.
☆63Updated 2 months ago
Alternatives and similar repositories for Llama-3-From-Scratch-In-Pure-Jax:
Users that are interested in Llama-3-From-Scratch-In-Pure-Jax are comparing it to the libraries listed below
- NanoGPT-speedrunning for the poor T4 enjoyers☆63Updated 2 weeks ago
- Simple Transformer in Jax☆136Updated 10 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆73Updated this week
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Updated last month
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated last month
- prime-rl is a codebase for decentralized RL training at scale☆85Updated this week
- look how they massacred my boy☆63Updated 6 months ago
- An introduction to LLM Sampling☆77Updated 4 months ago
- ☆27Updated 9 months ago
- ☆38Updated 9 months ago
- Learning about CUDA by writing PTX code.☆128Updated last year
- A package for defining deep learning models using categorical algebraic expressions.☆60Updated 9 months ago
- ☆45Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆98Updated 2 months ago
- Collection of autoregressive model implementation☆85Updated last week
- A really tiny autograd engine☆92Updated last year
- Jax like function transformation engine but micro, microjax☆31Updated 6 months ago
- Andrej Kapathy's micrograd implemented in c☆28Updated 9 months ago
- making the official triton tutorials actually comprehensible☆27Updated last month
- NanoGPT (124M) quality in 2.67B tokens☆28Updated 2 weeks ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆174Updated 9 months ago
- The history files when recording human interaction while solving ARC tasks☆108Updated last week
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆180Updated last week
- PageRank for LLMs☆41Updated 3 weeks ago
- ☆105Updated this week
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 6 months ago
- ☆177Updated this week
- Simple repository for training small reasoning models☆27Updated 3 months ago
- Our solution for the arc challenge 2024☆135Updated 2 months ago
- Simple GRPO scripts and configurations.☆58Updated 3 months ago