JmlrOrg / dmlr-style-fileLinks
☆12Updated 2 years ago
Alternatives and similar repositories for dmlr-style-file
Users that are interested in dmlr-style-file are comparing it to the libraries listed below
Sorting:
- ☆34Updated 2 years ago
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆106Updated 2 years ago
- Gradient Estimation with Discrete Stein Operators (NeurIPS 2022)☆17Updated 2 years ago
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆20Updated last year
- The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…☆46Updated 2 years ago
- ZeroC is a neuro-symbolic method that trained with elementary visual concepts and relations, can zero-shot recognize and acquire more com…☆33Updated 2 years ago
- Efficient empirical NTKs in PyTorch☆22Updated 3 years ago
- Blog post☆17Updated last year
- TopoTrans: Optimal Transport meets Topological Data Analysis☆14Updated 2 years ago
- ☆33Updated last year
- Parallelizing non-linear sequential models over the sequence length☆56Updated 6 months ago
- ☆12Updated last year
- ☆20Updated 2 months ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated 2 years ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Updated last year
- Code for "Bayesian Structure Learning with Generative Flow Networks"☆94Updated 3 years ago
- ☆23Updated 11 months ago
- ☆73Updated last year
- ☆18Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Updated 2 years ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆16Updated 3 years ago
- ☆22Updated 4 years ago
- ☆107Updated last year
- ☆37Updated 2 years ago
- ☆38Updated last year
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆41Updated 5 years ago
- ☆25Updated last year
- Self-Supervised Alignment with Mutual Information☆20Updated last year
- ☆31Updated 9 months ago
- ☆27Updated 2 years ago