jiwoongim / Online-Hyperparameter-Optimization-by-real-time-recurrent-learningLinks
Online Hyperparameter Optimization
☆10Updated 4 years ago
Alternatives and similar repositories for Online-Hyperparameter-Optimization-by-real-time-recurrent-learning
Users that are interested in Online-Hyperparameter-Optimization-by-real-time-recurrent-learning are comparing it to the libraries listed below
Sorting:
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆25Updated 3 years ago
- Model Patching: Closing the Subgroup Performance Gap with Data Augmentation☆42Updated 4 years ago
- ☆11Updated 3 years ago
- ☆37Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 2 years ago
- ☆62Updated 3 years ago
- An Empirical Study of Invariant Risk Minimization☆27Updated 5 years ago
- Code for paper "Can contrastive learning avoid shortcut solutions?" NeurIPS 2021.☆47Updated 3 years ago
- ☆18Updated 2 years ago
- ☆22Updated 2 years ago
- Fine-grained ImageNet annotations☆29Updated 5 years ago
- (ICML 2021) Mandoline: Model Evaluation under Distribution Shift☆30Updated 4 years ago
- Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).☆59Updated 3 years ago
- Code for the PAPA paper☆27Updated 2 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆33Updated last month
- PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"☆22Updated 4 years ago
- ☆25Updated 5 years ago
- Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)☆16Updated 2 years ago
- ☆20Updated 5 years ago
- Staged Training for Transformer Language Models☆32Updated 3 years ago
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction☆36Updated 3 years ago
- ☆27Updated 4 years ago
- ☆89Updated 2 months ago
- How certain is your transformer?☆25Updated 4 years ago
- Parameter Efficient Transfer Learning with Diff Pruning☆74Updated 4 years ago
- Improving Transformation Invariance in Contrastive Representation Learning☆13Updated 4 years ago
- "Predict, then Interpolate: A Simple Algorithm to Learn Stable Classifiers" ICML 2021☆18Updated 4 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…☆46Updated last year