jiwoongim / Online-Hyperparameter-Optimization-by-real-time-recurrent-learningLinks
Online Hyperparameter Optimization
☆11Updated 4 years ago
Alternatives and similar repositories for Online-Hyperparameter-Optimization-by-real-time-recurrent-learning
Users that are interested in Online-Hyperparameter-Optimization-by-real-time-recurrent-learning are comparing it to the libraries listed below
Sorting:
- Code for paper "Can contrastive learning avoid shortcut solutions?" NeurIPS 2021.☆47Updated 3 years ago
- https://arxiv.org/abs/2102.12594☆14Updated 2 years ago
- Official repository for Fourier model that can generate periodic signals☆10Updated 3 years ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆25Updated 3 years ago
- Gradient-based Hyperparameter Optimization Over Long Horizons☆14Updated 4 years ago
- ☆38Updated 4 years ago
- ☆32Updated last week
- Parameter Efficient Transfer Learning with Diff Pruning☆74Updated 4 years ago
- ☆34Updated 5 months ago
- Model Patching: Closing the Subgroup Performance Gap with Data Augmentation☆41Updated 5 years ago
- The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…☆46Updated 2 years ago
- ☆18Updated 3 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- ☆62Updated 3 years ago
- A simple Jax implementation of influence functions.☆18Updated last year
- ☆96Updated 3 years ago
- "Predict, then Interpolate: A Simple Algorithm to Learn Stable Classifiers" ICML 2021☆18Updated 4 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆61Updated 4 years ago
- Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).☆59Updated 3 years ago
- Weighted Training for Cross-Task Learning☆15Updated 2 years ago
- Prospect Pruning: Finding Trainable Weights at Initialization Using Meta-Gradients☆32Updated 3 years ago
- How certain is your transformer?☆25Updated 4 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆34Updated 5 months ago
- Tensorflow implementation of "Meta Dropout: Learning to Perturb Latent Features for Generalization" (ICLR 2020)☆27Updated 5 years ago
- Group-conditional DRO to alleviate spurious correlations☆15Updated 4 years ago
- ☆17Updated 2 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 3 years ago
- OOD Generalization and Detection (ACL 2020)☆60Updated 5 years ago
- ☆14Updated 3 years ago
- Improving Transformation Invariance in Contrastive Representation Learning☆13Updated 4 years ago