Efficient empirical NTKs in PyTorch
☆22Jun 13, 2022Updated 3 years ago
Alternatives and similar repositories for empirical-ntks
Users that are interested in empirical-ntks are comparing it to the libraries listed below
Sorting:
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Sep 4, 2023Updated 2 years ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- Code repo for the NeurIPS 2021 paper "Online Adaption to Label Distribution Shift".☆15Feb 15, 2023Updated 3 years ago
- New structural distributional shifts for evaluating graph models☆15Oct 25, 2023Updated 2 years ago
- Code for steering and monitoring with concepts vectors in LLMs. https://arxiv.org/abs/2502.03708☆21Aug 10, 2025Updated 6 months ago
- ☆35Feb 8, 2026Updated last month
- ☆25May 20, 2020Updated 5 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆58Aug 15, 2023Updated 2 years ago
- ☆56Aug 14, 2020Updated 5 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- ☆24Jun 7, 2021Updated 4 years ago
- An Investigation of Why Overparameterization Exacerbates Spurious Correlations☆30Jul 12, 2020Updated 5 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- ☆38Jul 13, 2022Updated 3 years ago
- ☆10Nov 15, 2023Updated 2 years ago
- ☆12Feb 22, 2021Updated 5 years ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated last year
- ☆12Mar 13, 2025Updated 11 months ago
- ☆43Dec 1, 2025Updated 3 months ago
- ☆52Jun 10, 2024Updated last year
- ☆38Jun 10, 2021Updated 4 years ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆43Feb 24, 2023Updated 3 years ago
- ☆16Oct 2, 2022Updated 3 years ago
- Official repository for "Stylized Adversarial Training" (TPAMI 2022)☆11Dec 30, 2022Updated 3 years ago
- Low-rank Highway Networks☆13Mar 11, 2016Updated 9 years ago
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 2 weeks ago
- ☆43May 23, 2023Updated 2 years ago
- a robust metric (robust fidelity) for XGNN (ICLR24)☆12Jun 3, 2025Updated 9 months ago
- Tutorials for MATH 4432 Statistical Machine Learning, HKUST, Fall 2022☆11Sep 17, 2024Updated last year
- ☆42Mar 23, 2023Updated 2 years ago
- ☆12Oct 5, 2020Updated 5 years ago
- Codebase for the EMNLP 2021 paper "HittER: Hierarchical Transformers for Knowledge Graph Embeddings".☆12Nov 1, 2021Updated 4 years ago
- The source code of "Empowering Language Understanding with Counterfactual Reasoning" (ACL'21)☆11Sep 3, 2021Updated 4 years ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 7 months ago
- A tool to create immutable algebraic data structures and visitors for Java (such as abstract syntax trees).☆16Jan 12, 2026Updated last month
- Simple MoE - Day 17 of 365 Days of Repos☆17Jan 17, 2025Updated last year
- ☆12Oct 5, 2022Updated 3 years ago
- Pytorch routines for (Ker)nel (Mac)hines☆11Oct 10, 2025Updated 4 months ago
- Syng: A syntactic approach to concurrent separation logic with propositional ghost state, fully mechanized in Agda☆12Nov 18, 2022Updated 3 years ago