sjunhongshen / DASHLinks

☆23

Alternatives and similar repositories for DASH

Users that are interested in DASH are comparing it to the libraries listed below

Sorting:

nick11roberts / XD
☆12Updated 3 years ago
mkhodak / relax
☆15Updated 3 years ago
gregorbachmann / scaling_mlps
☆51Updated last year
JeanKaddour / LAWA
Latest Weight Averaging (NeurIPS HITY 2022)
☆30Updated 2 years ago
SamsungSAILMontreal / ghn3
Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]
☆36Updated 10 months ago
SamsungSAILMontreal / PAPA
Repository for the PopulAtion Parameter Averaging (PAPA) paper
☆26Updated last year
facebookresearch / ModelRatatouille
Recycling diverse models
☆45Updated 2 years ago
aryol / inductive-scratchpad
Implementation for our paper "How Far Can Transformers Reason? The Locality Barrier and Inductive Scratchpad"
☆11Updated last year
fKunstner / noise-sgd-adam-sign
☆16Updated 2 years ago
js-d / sim_metric
☆36Updated last year
stanislavfort / dissect-git-re-basin
Replicating and dissecting the git-re-basin project in one-click-replication Colabs
☆36Updated 2 years ago
ganguli-lab / degrees-of-freedom
☆37Updated 3 years ago
HayeonLee / MetaD2A
Official PyTorch implementation of "Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets" (ICLR 2021)
☆64Updated 11 months ago
google-deepmind / ssl_hsic
☆37Updated 11 months ago
SamsungSAILMontreal / nino
Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]
☆19Updated last month
shikaiqiu / compute-better-spent
☆53Updated 9 months ago
mfederici / dsit
Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"
☆25Updated 3 years ago
alexrame / diwa
DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Updated 2 years ago
hlml / fortuitous_forgetting
☆19Updated 3 years ago
JeanKaddour / NoTrainNoGain
Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)
☆80Updated last year
JonasGeiping / dataaugs
☆18Updated 2 years ago
szc12153 / sparse_meta_tuning
Official implementation for Sparse MetA-Tuning (SMAT)
☆16Updated 3 weeks ago
google-research / growneuron
☆55Updated 11 months ago
smonsays / contrastive-meta-learning
Code accompanying the paper "A contrastive rule for meta-learning"
☆12Updated 8 months ago
YannDubs / Invariant-Self-Supervised-Learning
Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"
☆41Updated 2 years ago
samuelstanton / gnosis
Code to reproduce experiments from 'Does Knowledge Distillation Really Work' a paper which appeared in the 2021 NeurIPS proceedings.
☆33Updated last year
AvivNavon / DWSNets
Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]
☆89Updated last year
xu-ji / information-bottleneck
Deep Learning & Information Bottleneck
☆61Updated 2 years ago
microsoft / fnl_paper
Factorized Neural Layers
☆29Updated 2 years ago
shoaibahmed / metadata_archaeology
Official code for the paper: "Metadata Archaeology"
☆19Updated 2 years ago