amjadmajid / BabyTorchLinks

BabyTorch is a minimalist deep-learning framework with a similar API to PyTorch. This minimalist design encourages learners explore and understand the underlying algorithms and mechanics of deep learning processes. It is design such that when learners are ready to switch to PyTorch they only need to remove the word `baby`.

☆26

Alternatives and similar repositories for BabyTorch

Users that are interested in BabyTorch are comparing it to the libraries listed below

Sorting:

imbue-ai / carbs
Cost aware hyperparameter tuning algorithm
☆163Updated last year
google-deepmind / nanodo
☆274Updated last year
jax-ml / jax-llm-examples
☆136Updated last week
kvfrans / jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
☆279Updated last year
jax-ml / scaling-book
Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
☆435Updated this week
kvfrans / splus
☆114Updated last month
dshah3 / GPU-Puzzles
Solve puzzles. Learn CUDA.
☆64Updated last year
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆149Updated 3 weeks ago
young-geng / scalax
A simple library for scaling up JAX programs
☆139Updated 8 months ago
MatX-inc / seqax
seqax = sequence modeling + JAX
☆165Updated this week
young-geng / mintext
Minimal but scalable implementation of large language models in JAX
☆35Updated last week
jenkspt / gpt-jax
Jax/Flax rewrite of Karpathy's nanoGPT
☆59Updated 2 years ago
huggingface / picotron_tutorial
☆203Updated 5 months ago
modula-systems / modula
🧱 Modula software package
☆209Updated 3 months ago
erfanzar / EasyDeL
Accelerate, Optimize performance with streamlined training and serving options with JAX.
☆292Updated this week
stanford-crfm / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆625Updated this week
AllanYangZhou / midGPT
Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.
☆24Updated 9 months ago
NX-AI / xlstm-jax
Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…
☆97Updated 6 months ago
davisyoshida / lorax
LoRA for arbitrary JAX models and functions
☆140Updated last year
lucidrains / evolutionary-policy-optimization
Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University
☆97Updated 2 weeks ago
HomebrewML / HeavyBall
Efficient optimizers
☆249Updated last week
MekkCyber / TritonAcademy
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
☆188Updated last month
PeaBrane / mamba-tiny
Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).
☆120Updated 9 months ago
KellerJordan / cifar10-airbench
CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds
☆266Updated last week
clement-bonnet / lpn
Latent Program Network (from the "Searching Latent Program Spaces" paper)
☆92Updated 4 months ago
gpu-mode / profiling-cuda-in-torch
☆162Updated last year
NVIDIA / JAX-Toolbox
JAX-Toolbox
☆324Updated this week
BlackHC / neural_net_checklist
☆150Updated 11 months ago
facebookresearch / optimizers
For optimization algorithm research and development.
☆522Updated last week
JesseFarebro / flax-mup
Maximal Update Parametrization (μP) with Flax & Optax.
☆16Updated last year