amjadmajid / BabyTorchLinks
BabyTorch is a minimalist deep-learning framework with a similar API to PyTorch. This minimalist design encourages learners explore and understand the underlying algorithms and mechanics of deep learning processes. It is design such that when learners are ready to switch to PyTorch they only need to remove the word `baby`.
☆26Updated 5 months ago
Alternatives and similar repositories for BabyTorch
Users that are interested in BabyTorch are comparing it to the libraries listed below
Sorting:
- Cost aware hyperparameter tuning algorithm☆172Updated last year
- ☆283Updated last year
- Minimal yet performant LLM examples in pure JAX☆193Updated last month
- Minimal but scalable implementation of large language models in JAX☆35Updated 2 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆293Updated last year
- Synchronized Curriculum Learning for RL Agents☆114Updated 2 months ago
- ☆120Updated 4 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆102Updated last month
- A simple library for scaling up JAX programs☆144Updated this week
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆24Updated last year
- Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University☆102Updated last month
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆321Updated 3 months ago
- seqax = sequence modeling + JAX☆168Updated 3 months ago
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆104Updated 10 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆37Updated last year
- torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JA…☆117Updated this week
- ☆150Updated last year
- Solve puzzles. Learn CUDA.☆64Updated last year
- A simple, performant and scalable JAX-based world modeling codebase☆77Updated last week
- Minimal, lightweight JAX implementations of popular models.☆117Updated this week
- Accelerated minigrid environments with JAX☆151Updated 2 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆171Updated 4 months ago
- a Jax quantization library☆56Updated this week
- 🧱 Modula software package☆300Updated 2 months ago
- A set of Python scripts that makes your experience on TPU better☆54Updated last month
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- Schedule free optimiser implemented in JAX using Optimistix☆15Updated last year
- Jax/Flax rewrite of Karpathy's nanoGPT☆62Updated 2 years ago
- ☆118Updated last week
- Flax (Jax) implementation of DeepSeek-R1-Distill-Qwen-1.5B with weights ported from Hugging Face.☆25Updated 8 months ago