mzguntalan / zephyr
Zephyr is a declarative neural network library on top of JAX allowing for easy and fast neural network designing, creation, and manipulation
☆34Updated last month
Alternatives and similar repositories for zephyr:
Users that are interested in zephyr are comparing it to the libraries listed below
- ☆27Updated 6 months ago
- A pure and fast NumPy implementation of Mamba with cache support.☆17Updated 7 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆95Updated 3 weeks ago
- Tensor library with autograd using only Rust's standard library☆65Updated 6 months ago
- 🧱 Modula software package☆132Updated this week
- Alpha-Zero Connect Four NN trained via self play☆13Updated 3 months ago
- Schedule free optimiser implemented in JAX using Optimistix☆14Updated 7 months ago
- Gpu benchmark☆50Updated 3 months ago
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆31Updated this week
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code☆36Updated 3 weeks ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆43Updated last month
- ☆146Updated last month
- Implementation of DreamerV3 in Pytorch☆42Updated 2 months ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Updated 3 months ago
- σ-GPT: A New Approach to Autoregressive Models☆61Updated 5 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 3 months ago
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated 2 weeks ago
- supporting pytorch FSDP for optimizers☆75Updated last month
- DeMo: Decoupled Momentum Optimization☆171Updated last month
- Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead☆210Updated 2 weeks ago
- Efficient optimizers☆145Updated this week
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆14Updated this week
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆26Updated 2 months ago
- Exploration into the Firefly algorithm in Pytorch☆33Updated 4 months ago
- Implementation snake game based on Diffusion model☆79Updated last week
- A package for defining deep learning models using categorical algebraic expressions.☆58Updated 5 months ago
- Evaluating the Mamba architecture on the Othello game☆44Updated 8 months ago
- Focused on fast experimentation and simplicity☆64Updated 3 weeks ago
- A nano protein structure prediction model based on DeepMind's AlphaFold paper☆24Updated 7 months ago
- Cellular Automata Accelerated in JAX☆77Updated 2 months ago