andbloch / eth-dl-cheat-sheet
Cheat sheet for the "Deep Learning" course at ETH Zürich
☆20Updated 5 years ago
Alternatives and similar repositories for eth-dl-cheat-sheet:
Users that are interested in eth-dl-cheat-sheet are comparing it to the libraries listed below
- WandB sweeps integration with Hydra sweeper☆47Updated last year
- Repository for my Big Data Optimization course☆34Updated 4 years ago
- Meta Optimal Transport☆98Updated last year
- Interactive textbook on state-space models☆182Updated last year
- Temporal Predictive Coding For Model-Based Planning In Latent Space (ICML-2021)☆13Updated 6 months ago
- Codebase for Mechanistic Mode Connectivity☆13Updated last year
- PyTorch Package For Quasimetric Learning☆41Updated 3 months ago
- Code for A General Recipe for Likelihood-free Bayesian Optimization, ICML 2022☆44Updated 2 years ago
- This repository contains a Jax implementation of conformal training corresponding to the ICLR'22 paper "learning optimal conformal classi…☆129Updated 2 years ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆59Updated 3 years ago
- Fast training of unitary deep network layers from low-rank updates☆28Updated 2 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆104Updated 4 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆72Updated 2 years ago
- Code repository of the paper "CITRIS: Causal Identifiability from Temporal Intervened Sequences" and "iCITRIS: Causal Representation Lear…☆50Updated last year
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Transformers with doubly stochastic attention☆45Updated 2 years ago
- The Modified Differential Multiplier Method (MDMM) for PyTorch☆56Updated 3 years ago
- ☆31Updated 2 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆112Updated last year
- ☆42Updated last year
- Gradient-based constrained optimization for JAX☆29Updated 2 years ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 2 years ago
- LaTeX style file for the Journal of Machine Learning Research☆121Updated 7 months ago
- Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024☆99Updated 2 months ago
- ☆35Updated last year
- ☆30Updated 2 months ago
- 🌲 Stanford CS 228 - Probabilistic Graphical Models☆118Updated 5 months ago
- Codes for the paper "A mathematical perspective on Transformers".☆34Updated 7 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Beyond Straight-Through☆93Updated last year