amjadmajid / BabyTorchLinks
BabyTorch is a minimalist deep-learning framework with a similar API to PyTorch. This minimalist design encourages learners explore and understand the underlying algorithms and mechanics of deep learning processes. It is design such that when learners are ready to switch to PyTorch they only need to remove the word `baby`.
☆26Updated 3 months ago
Alternatives and similar repositories for BabyTorch
Users that are interested in BabyTorch are comparing it to the libraries listed below
Sorting:
- Cost aware hyperparameter tuning algorithm☆168Updated last year
- Minimal yet performant LLM examples in pure JAX☆160Updated this week
- Implementation of Diffusion Transformer (DiT) in JAX☆291Updated last year
- ☆120Updated 3 months ago
- 🧱 Modula software package☆239Updated last month
- Efficient optimizers☆261Updated this week
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆301Updated 2 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆164Updated 3 months ago
- A simple library for scaling up JAX programs☆143Updated 10 months ago
- ☆281Updated last year
- Jax/Flax rewrite of Karpathy's nanoGPT☆60Updated 2 years ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆98Updated 6 months ago
- Synchronized Curriculum Learning for RL Agents☆113Updated last month
- ☆150Updated last year
- ☆214Updated 9 months ago
- Minimal but scalable implementation of large language models in JAX☆35Updated 3 weeks ago
- Minimal, lightweight JAX implementations of popular models.☆109Updated this week
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆103Updated 8 months ago
- Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University☆98Updated 3 weeks ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorch☆96Updated 2 months ago
- Accelerated minigrid environments with JAX☆147Updated 3 weeks ago
- 📄Small Batch Size Training for Language Models☆62Updated this week
- Implementation of the Llama architecture with RLHF + Q-learning☆167Updated 7 months ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Updated 11 months ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆311Updated this week
- supporting pytorch FSDP for optimizers☆84Updated 9 months ago
- Annotated version of the Mamba paper☆490Updated last year
- Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster☆70Updated 4 months ago
- ☆111Updated 2 weeks ago