This repository contain the simple llama3 implementation in pure jax.
☆72Feb 17, 2025Updated last year
Alternatives and similar repositories for Llama-3-From-Scratch-In-Pure-Jax
Users that are interested in Llama-3-From-Scratch-In-Pure-Jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Flax (Jax) implementation of DeepSeek-R1-Distill-Qwen-1.5B with weights ported from Hugging Face.☆26Feb 20, 2025Updated last year
- Fixed-point scalar and matrix multiplication library for SectorLISP☆15Jan 23, 2022Updated 4 years ago
- Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…☆38Oct 24, 2025Updated 7 months ago
- working implimention of deepseek MLA☆44Jan 8, 2025Updated last year
- Xtructure is datastructure for using in JAX☆22May 18, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for NeurIPS 2024 paper "A SARS-CoV-2 Interaction Dataset and VHH Sequence Corpus for Antibody Language Models"☆15Oct 17, 2024Updated last year
- Fast reinforcement learning 💨☆29Jul 15, 2025Updated 10 months ago
- ☆13Jun 3, 2024Updated last year
- learningggggggg 🐳☆619Apr 2, 2025Updated last year
- Inference code for LLaMA models in JAX☆120May 21, 2024Updated 2 years ago
- ☆29Dec 15, 2025Updated 5 months ago
- This repository contains examples for RxInfer.jl☆27May 18, 2026Updated last week
- Implementation of Denoising Diffusion Probabilistic Models (DDPM) in JAX and Flax.☆22Oct 12, 2023Updated 2 years ago
- Crestle version of fast.ai courses☆14Nov 22, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- moondream in zig.☆76May 30, 2025Updated 11 months ago
- [ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models☆39Nov 4, 2025Updated 6 months ago
- ☆355Apr 13, 2026Updated last month
- ☆25Dec 5, 2025Updated 5 months ago
- Implementations of Papers that I read, you can read my breakdown in my blog☆92Oct 23, 2025Updated 7 months ago
- A toolkit for building multimodal, streaming APIs and UIs☆81Mar 2, 2026Updated 2 months ago
- Leo optimizer, variation of Muon that runs faster☆59Sep 6, 2025Updated 8 months ago
- Teaching materials for improving research software writing abilities.☆14Apr 16, 2026Updated last month
- ☆18Dec 2, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Jan 24, 2025Updated last year
- Simple repository for training small reasoning models☆52Feb 17, 2026Updated 3 months ago
- ☆35May 15, 2026Updated 2 weeks ago
- JAxtar is a project with a JAX-native implementation of parallelizeable A* & Q* solver for neural heuristic search research.☆50May 18, 2026Updated last week
- ☆14Feb 9, 2026Updated 3 months ago
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆25May 20, 2024Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆19Nov 1, 2023Updated 2 years ago
- A library for training crosscoders☆17May 28, 2025Updated last year
- ☆27Mar 6, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Sparse Fourier Backpropagation in Cryo-EM Reconstruction☆12Dec 3, 2023Updated 2 years ago
- A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…☆19Jan 11, 2025Updated last year
- Create cohorts from databases utilizing the OMOP CDM☆10May 19, 2025Updated last year
- Basic world models☆32Oct 30, 2025Updated 6 months ago
- ☆21Jun 26, 2023Updated 2 years ago
- Codebase for training the SubCell models☆20Updated this week
- ☆24May 22, 2026Updated last week