mzguntalan / zephyr
Zephyr is a declarative neural network library on top of JAX allowing for easy and fast neural network designing, creation, and manipulation
☆35Updated 3 months ago
Alternatives and similar repositories for zephyr:
Users that are interested in zephyr are comparing it to the libraries listed below
- ☆27Updated 8 months ago
- A pure and fast NumPy implementation of Mamba with cache support.☆17Updated 9 months ago
- 🧱 Modula software package☆187Updated this week
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆76Updated 3 weeks ago
- ☆42Updated last week
- train entropix like a champ!☆20Updated 5 months ago
- Schedule free optimiser implemented in JAX using Optimistix☆14Updated 10 months ago
- look how they massacred my boy☆63Updated 5 months ago
- Clean RL implementation using MLX☆28Updated last year
- ☆20Updated 4 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆60Updated 8 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆46Updated 3 months ago
- Fast reinforcement learning 💨☆24Updated 2 weeks ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆30Updated 5 months ago
- Training AI for Super Smash Bros. Melee☆25Updated this week
- Implementation for robust ViT and scaled attention☆18Updated 5 months ago
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated 2 months ago
- ☆49Updated last year
- Generative cellular automaton-like learning environments for RL.☆19Updated 2 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆23Updated 2 months ago
- supporting pytorch FSDP for optimizers☆80Updated 3 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated 2 weeks ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Updated 5 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆91Updated 3 weeks ago
- Cellular Automata Accelerated in JAX (Oral at ICLR 2025)☆84Updated this week
- A Wadler--Lindig pretty printer for Python☆38Updated this week
- ☆47Updated 4 months ago
- Scalable and Stable Parallelization of Nonlinear RNNS☆14Updated 2 months ago
- Rust Implementation of micrograd☆51Updated 8 months ago
- Flax (Jax) implementation of DeepSeek-R1-Distill-Qwen-1.5B with weights ported from Hugging Face.☆16Updated last month