amjadmajid / BabyTorchLinks
BabyTorch is a minimalist deep-learning framework with a similar API to PyTorch. This minimalist design encourages learners explore and understand the underlying algorithms and mechanics of deep learning processes. It is design such that when learners are ready to switch to PyTorch they only need to remove the word `baby`.
☆26Updated 7 months ago
Alternatives and similar repositories for BabyTorch
Users that are interested in BabyTorch are comparing it to the libraries listed below
Sorting:
- ☆123Updated 7 months ago
- Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University☆103Updated 4 months ago
- Minimal yet performant LLM examples in pure JAX☆236Updated 2 weeks ago
- Cost aware hyperparameter tuning algorithm☆177Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆306Updated last year
- ☆135Updated last month
- Minimal but scalable implementation of large language models in JAX☆35Updated 2 months ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Updated last year
- ☆289Updated last year
- Minimal, lightweight JAX implementations of popular models.☆180Updated this week
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆107Updated 2 months ago
- Synchronized Curriculum Learning for RL Agents☆119Updated 2 months ago
- A simple library for scaling up JAX programs☆144Updated 2 months ago
- Flax (Jax) implementation of DeepSeek-R1-Distill-Qwen-1.5B with weights ported from Hugging Face.☆26Updated 11 months ago
- 🧱 Modula software package☆322Updated 5 months ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆334Updated 3 weeks ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆63Updated 2 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆37Updated 2 years ago
- ☆215Updated last year
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆349Updated 2 months ago
- A simple, performant and scalable JAX-based world modeling codebase.☆123Updated 2 weeks ago
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆24Updated last year
- seqax = sequence modeling + JAX☆170Updated 6 months ago
- Accelerated minigrid environments with JAX☆156Updated 3 months ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆298Updated last year
- Einsum-like high-level array sharding API for JAX☆34Updated last year
- Notebooks for the "Deep Learning with JAX" book☆168Updated 7 months ago
- LoRA for arbitrary JAX models and functions☆144Updated last year
- If it quacks like a tensor...☆59Updated last year
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆105Updated last year