RobertCsordas / onion_representationsLinks

☆11

Alternatives and similar repositories for onion_representations

Users that are interested in onion_representations are comparing it to the libraries listed below

Sorting:

aks2203 / deep-thinking
A centralized place for deep thinking code and experiments
☆85Updated last year
KihoPark / linear_rep_geometry
☆99Updated 5 months ago
locuslab / edge-of-stability
☆70Updated 7 months ago
automl / is_mamba_capable_of_icl
☆18Updated last year
yilundu / ebm_compositionality
[NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models
☆45Updated 2 years ago
EkdeepSLubana / MMC
Codebase for Mechanistic Mode Connectivity
☆15Updated 2 years ago
GFNOrg / gfn-diffusion
☆35Updated 3 months ago
google-research / jax-influence
☆60Updated 3 years ago
yilundu / irem_code_release
ICML 2022: Learning Iterative Reasoning through Energy Minimization
☆46Updated 2 years ago
DeqingFu / transformers-icl-second-order
Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…
☆17Updated 7 months ago
mansheej / icl-task-diversity
Code for the paper "Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression"
☆21Updated 2 years ago
tml-epfl / sharpness-vs-generalization
A modern look at the relationship between sharpness and generalization [ICML 2023]
☆43Updated last year
Ping-C / optimizer
This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…
☆37Updated 2 years ago
ethancaballero / broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Updated last year
AllanYangZhou / universal_neural_functional
☆51Updated last year
bilal-chughtai / rep-theory-mech-interp
☆26Updated 2 years ago
tung-nd / TNP-pytorch
Official implementation of Transformer Neural Processes
☆78Updated 2 years ago
Johswald / awesome-hypernetworks
☆65Updated 3 years ago
machine-discovery / deer
Parallelizing non-linear sequential models over the sequence length
☆52Updated 3 weeks ago
ejmichaud / grokking-squared
☆26Updated 2 years ago
gibipara92 / learning-explanations-hard-to-vary
Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …
☆39Updated 4 years ago
GFNOrg / GFlowNet-EM
Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.
☆41Updated last year
multimodal-interpretability / FIND
Official implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents
☆49Updated 9 months ago
radarFudan / mamba-minimal-jax
☆31Updated 7 months ago
hlml / fortuitous_forgetting
☆19Updated 3 years ago
skolouri / TopoTrans
TopoTrans: Optimal Transport meets Topological Data Analysis
☆14Updated 2 years ago
yilundu / comet
[NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts
☆62Updated 2 years ago
aniruddhraghu / meta-pretraining
Code accompanying paper: Meta-Learning to Improve Pre-Training
☆37Updated 3 years ago
erosenfeld / disagree_discrep
Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.
☆10Updated last year
tristandeleu / jax-meta-learning
A collection of meta-learning algorithms in Jax
☆23Updated 2 years ago