andbloch / eth-dl-cheat-sheet
Cheat sheet for the "Deep Learning" course at ETH Zürich
☆20Updated 5 years ago
Alternatives and similar repositories for eth-dl-cheat-sheet:
Users that are interested in eth-dl-cheat-sheet are comparing it to the libraries listed below
- Fast training of unitary deep network layers from low-rank updates☆28Updated 2 years ago
- Gradient-based constrained optimization for JAX☆28Updated 2 years ago
- Omnigrok: Grokking Beyond Algorithmic Data☆51Updated last year
- 🌲 Stanford CS 228 - Probabilistic Graphical Models☆116Updated 4 months ago
- Sampling with gradient-based Markov Chain Monte Carlo approaches☆94Updated 8 months ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆67Updated 2 years ago
- ☆123Updated last week
- Parameter-Free Optimizers for Pytorch☆109Updated 8 months ago
- ☆59Updated 3 years ago
- A general-purpose, deep learning-first library for constrained optimization in PyTorch☆110Updated this week
- SR based on LLMs.☆92Updated 2 years ago
- Code for papers Linear Algebra with Transformers (TMLR) and What is my Math Transformer Doing? (AI for Maths Workshop, Neurips 2022)☆64Updated 5 months ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆58Updated 3 years ago
- Ying Nian Wu's UCLA Statistical Machine Learning Tutorial on generative modeling.☆54Updated 2 years ago
- Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.☆35Updated last year
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆58Updated 3 months ago
- ☆34Updated last month
- ☆35Updated last year
- [ICLR 2024 Spotlight] This is the official code for the paper "SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-t…☆50Updated 2 months ago
- PyTorch implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆34Updated 3 years ago
- Neural Networks and the Chomsky Hierarchy☆194Updated 9 months ago
- ☆27Updated last year
- symbolic regression☆36Updated 2 years ago
- Course repository for the Spring COMP790 course "Deep Learning" at UNC☆23Updated 2 years ago
- Resources from the EleutherAI Math Reading Group☆52Updated last month
- Sparse and discrete interpretability tool for neural networks☆58Updated 11 months ago
- Efficiently Composable Data Augmentation on the GPU with Jax☆32Updated 6 months ago
- Autoregressive Models in PyTorch.☆77Updated 2 years ago
- Repository for my Big Data Optimization course☆33Updated 3 years ago