thetechdude124 / Adam-Optimization-From-ScratchLinks
📈Implementing the ADAM optimizer from the ground up with PyTorch and comparing its performance on six 3-D objective functions (each progressively more difficult to optimize) against SGD, AdaGrad, and RMSProp.
☆21Updated 3 years ago
Alternatives and similar repositories for Adam-Optimization-From-Scratch
Users that are interested in Adam-Optimization-From-Scratch are comparing it to the libraries listed below
Sorting:
- High order and sparse layers in pytorch. Lagrange Polynomial, Piecewise Lagrange Polynomial, Piecewise Discontinuous Lagrange Polynomial…☆45Updated last year
- Toy genetic algorithm in Pytorch☆52Updated 5 months ago
- ☆28Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Updated last year
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆14Updated last year
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated 9 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- Rust Implementation of micrograd☆53Updated last year
- ☆45Updated 5 months ago
- Because we don't want a jupyter notebook mess...☆61Updated 4 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆21Updated 3 months ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆72Updated 3 months ago
- Tutorial for Harvard Medical School ML from Scratch Series: Transformer from Scratch. Demo the usage of transformer in various domains: M…☆45Updated 2 years ago
- A simple example of VAEs with KANs☆12Updated last year
- Code associated to papers on superposition (in ML interpretability)☆33Updated 3 years ago
- Quantum Dynamical Hamiltonian Monte Carlo☆34Updated last year
- alternative way to calculating self attention☆18Updated last year
- ☆15Updated 3 years ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆133Updated 8 months ago
- Fine-grained, dynamic control of neural network topology in JAX.☆21Updated 2 years ago
- ☆30Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆56Updated 2 weeks ago
- a decentralized dataset generator and manipulator.☆11Updated last week
- This is the code that went into our practical dive using mamba as information extraction☆55Updated last year
- ☆61Updated 3 months ago
- TensorFlow implementation of involution.☆13Updated 4 years ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆59Updated last year
- Simple repository for training small reasoning models☆40Updated 8 months ago
- aesthetic tensor visualiser☆27Updated 5 months ago