wtong98 / mlp-icl

☆10

Alternatives and similar repositories for mlp-icl

Users that are interested in mlp-icl are comparing it to the libraries listed below

Sorting:

shikaiqiu / compute-better-spent
☆53Updated 7 months ago
sjunhongshen / DASH
☆22Updated 2 years ago
locuslab / orthogonal-convolutions
Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness
☆44Updated 4 years ago
thudzj / ELLA
Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')
☆16Updated 2 years ago
Niccolo-Ajroldi / plainLM
Minimal pretraining script for language modeling in PyTorch. Supporting torch compilation and DDP. It includes a model implementation and…
☆12Updated last month
JeanKaddour / LAWA
Latest Weight Averaging (NeurIPS HITY 2022)
☆30Updated last year
team-approx-bayes / bayesian-sam
Code for "SAM as an Optimal Relaxation of Bayes", ICLR 2023.
☆25Updated last year
aks2203 / deep-thinking
A centralized place for deep thinking code and experiments
☆84Updated last year
IlanPrice / DCTpS
Code for testing DCT plus Sparse (DCTpS) networks
☆14Updated 3 years ago
aryol / inductive-scratchpad
Implementation for our paper "How Far Can Transformers Reason? The Locality Barrier and Inductive Scratchpad"
☆11Updated 11 months ago
pnnl / torchntk
☆27Updated 2 years ago
gortizji / linearized-networks
Source code of "What can linearized neural networks actually say about generalization?
☆20Updated 3 years ago
xu-ji / information-bottleneck
Deep Learning & Information Bottleneck
☆60Updated last year
js-d / sim_metric
☆35Updated last year
tml-epfl / sgd-sparse-features
SGD with large step sizes learns sparse features [ICML 2023]
☆32Updated 2 years ago
mariuslindegaard / Intermediate_Neural_Collapse
(ICML 2023) Feature learning in deep classifiers through Intermediate Neural Collapse: Accompanying code
☆15Updated last year
google-deepmind / spectral_ssm
☆31Updated last year
MarlonBecker / MSAM
☆18Updated last year
tml-epfl / understanding-sam
Towards Understanding Sharpness-Aware Minimization [ICML 2022]
☆35Updated 2 years ago
pilancilab / Riemannian_Preconditioned_LoRA
source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"
☆24Updated 10 months ago
hlml / fortuitous_forgetting
☆19Updated 3 years ago
ablghtianyi / ICL_Modular_Arithmetic
☆18Updated last month
nick11roberts / XD
☆11Updated 2 years ago
chengxiang / LinearTransformer
Pytorch code for experiments on Linear Transformers
☆20Updated last year
stanislavfort / dissect-git-re-basin
Replicating and dissecting the git-re-basin project in one-click-replication Colabs
☆36Updated 2 years ago
tfjgeorge / nngeometry-examples
Example code for the NNGeometry PyTorch library
☆10Updated 2 months ago
automl / is_mamba_capable_of_icl
☆18Updated last year
AndPotap / einsum-search
☆32Updated 7 months ago
allenbai01 / transformers-as-statisticians
☆31Updated last year
Ping-C / optimizer
This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…
☆37Updated 2 years ago