jordan-benjamin / pydraLinks
Simple, flexible configuration in pure Python!
☆26Updated 2 months ago
Alternatives and similar repositories for pydra
Users that are interested in pydra are comparing it to the libraries listed below
Sorting:
- seqax = sequence modeling + JAX☆166Updated last month
- ☆277Updated last year
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆129Updated 2 years ago
- Minimal but scalable implementation of large language models in JAX☆35Updated this week
- 🧱 Modula software package☆231Updated 2 weeks ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆59Updated 2 years ago
- we got you bro☆36Updated last year
- JAX Synergistic Memory Inspector☆179Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆648Updated last week
- LoRA for arbitrary JAX models and functions☆142Updated last year
- A simple library for scaling up JAX programs☆143Updated 10 months ago
- ☆87Updated last year
- A MAD laboratory to improve AI architecture designs 🧪☆127Updated 8 months ago
- Train very large language models in Jax.☆208Updated last year
- A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations☆196Updated 3 years ago
- A puzzle to learn about prompting☆132Updated 2 years ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆131Updated last year
- 🧠 Starter templates for doing interpretability research☆73Updated 2 years ago
- git extension for {collaborative, communal, continual} model development☆216Updated 9 months ago
- nanoGPT-like codebase for LLM training☆103Updated 3 months ago
- Inference code for LLaMA models in JAX☆118Updated last year
- Named Tensors for Legible Deep Learning in JAX☆201Updated last week
- Minimal yet performant LLM examples in pure JAX☆151Updated this week
- Experiment of using Tangent to autodiff triton☆80Updated last year
- A library for unit scaling in PyTorch☆130Updated last month
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆156Updated 2 months ago
- Mechanistic Interpretability for Transformer Models☆51Updated 3 years ago
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆188Updated 2 years ago
- Stochastic Parameter Decomposition☆37Updated this week
- A dataset of alignment research and code to reproduce it☆77Updated 2 years ago