wilson1yan / cs294-158-ssl
☆14Updated 2 years ago
Alternatives and similar repositories for cs294-158-ssl:
Users that are interested in cs294-158-ssl are comparing it to the libraries listed below
- ☆19Updated 2 years ago
- ☆36Updated 3 years ago
- Code for "Supermasks in Superposition"☆121Updated last year
- ☆95Updated 2 years ago
- Layerwise Batch Entropy Regularization☆22Updated 2 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 3 years ago
- ☆35Updated last year
- Collection of snippets for PyTorch users☆25Updated 3 years ago
- Official code for the paper: "Metadata Archaeology"☆19Updated last year
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆80Updated 3 years ago
- NEVIS'22: Benchmarking the next generation of never-ending learners☆101Updated 2 years ago
- ☆34Updated 3 years ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- Cyclemoid implementation for PyTorch☆87Updated 2 years ago
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction☆35Updated 2 years ago
- Explores the ideas presented in Deep Ensembles: A Loss Landscape Perspective (https://arxiv.org/abs/1912.02757) by Stanislav Fort, Huiyi …☆63Updated 4 years ago
- A centralized place for deep thinking code and experiments☆82Updated last year
- ☆108Updated last year
- Bayesianize: A Bayesian neural network wrapper in pytorch☆88Updated 10 months ago
- Personal implementation of ASIF by Antonio Norelli☆25Updated 9 months ago
- Adversarial examples to the new ConvNeXt architecture☆20Updated 3 years ago
- ☆73Updated 2 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- CIFAR-5m dataset☆38Updated 4 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆95Updated last year
- Gradient Starvation: A Learning Proclivity in Neural Networks☆61Updated 4 years ago
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆31Updated 4 years ago
- This repository contains a Jax implementation of conformal training corresponding to the ICLR'22 paper "learning optimal conformal classi…☆129Updated 2 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 2 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago