interactive visualization of 5 popular gradient descent methods with step-by-step illustration and hyperparameter tuning UI
☆1,401Aug 4, 2024Updated last year
Alternatives and similar repositories for gradient_descent_viz
Users that are interested in gradient_descent_viz are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Reinforcement Learning: Zero to Hero!☆2,290May 26, 2026Updated 3 weeks ago
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆210Sep 12, 2024Updated last year
- ✨ rudimentary simulation of the three-body problem☆158Apr 2, 2024Updated 2 years ago
- ☆4,121Apr 15, 2026Updated 2 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,893Jun 22, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- llama3 implementation one matrix multiplication at a time☆15,229May 23, 2024Updated 2 years ago
- Optimum graph creation and distribution for underground networks.☆34Jun 24, 2024Updated last year
- ☆1,088May 18, 2025Updated last year
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆376Jun 11, 2024Updated 2 years ago
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆35,826Updated this week
- Production-ready K-Means clustering for Apache Spark with pluggable Bregman divergences (KL, Itakura-Saito, L1, etc). 6 algorithms, 740 …☆342Feb 14, 2026Updated 4 months ago
- LLM Analytics☆714Oct 19, 2024Updated last year
- Visualizer for neural network, deep learning and machine learning models☆33,086Updated this week
- A playbook for systematically maximizing the performance of deep learning models.☆30,190Jun 18, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- llama3.np is a pure NumPy implementation for Llama 3 model.☆992Apr 27, 2025Updated last year
- A guide on how to understand the performance of your battery with modelling and improve it☆423Jul 28, 2024Updated last year
- High-Performance Implementation of OpenAI's TikToken.☆475Jul 3, 2025Updated 11 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆59,723Nov 12, 2025Updated 7 months ago
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆161,694Updated this week
- Animation engine for explanatory math videos☆87,573Apr 18, 2026Updated 2 months ago
- Richard is gaining power☆200Jun 21, 2025Updated 11 months ago
- Animating R1's thoughts.☆380Feb 17, 2025Updated last year
- Distribute and run LLMs with a single file.☆24,950Jun 9, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Solve puzzles. Improve your pytorch.☆4,147Jul 15, 2024Updated last year
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆24,533Aug 15, 2024Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆42,508Updated this week
- A modern model graph visualizer and debugger☆1,503Updated this week
- LLM101n: Let's build a Storyteller☆37,316Aug 1, 2024Updated last year
- Grow virtual creatures in static and physics simulated environments.☆53Mar 20, 2024Updated 2 years ago
- Inference Llama 2 in one file of pure C☆19,631Aug 6, 2024Updated last year
- LLM inference in C/C++☆116,603Updated this week
- A pure NumPy implementation of Mamba.☆221Jul 8, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LLM training in simple, raw C/CUDA☆30,209Jun 26, 2025Updated 11 months ago
- A library for efficient similarity search and clustering of dense vectors.☆40,309Updated this week
- Machine Learning Engineering Open Book☆18,137May 18, 2026Updated last month
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,885Jun 3, 2026Updated 2 weeks ago
- Google Research☆38,126Updated this week
- Solve puzzles. Learn CUDA.☆12,231Sep 1, 2024Updated last year
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,573Jul 1, 2024Updated last year