cxy1997 / graphite-utils
☆26Updated 11 months ago
Alternatives and similar repositories for graphite-utils:
Users that are interested in graphite-utils are comparing it to the libraries listed below
- Instructions for using the graphite cluster☆22Updated 5 years ago
- ☆157Updated 2 years ago
- ☆65Updated 3 months ago
- ☆217Updated 10 months ago
- ☆20Updated 4 years ago
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"☆22Updated last year
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆38Updated 5 years ago
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆140Updated 7 months ago
- An Investigation of Why Overparameterization Exacerbates Spurious Correlations☆30Updated 4 years ago
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆45Updated last month
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆104Updated last year
- [ICML 2021] This is the official github repo for training L_inf dist nets with high certified accuracy.☆41Updated 3 years ago
- Reparameterize your PyTorch modules☆70Updated 4 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆105Updated 4 years ago
- ☆81Updated 7 months ago
- A centralized place for deep thinking code and experiments☆82Updated last year
- In Defense of the Unitary Scalarization for Deep Multi-Task Learning☆21Updated 2 years ago
- ☆28Updated 8 months ago
- Template and style files for ICLR☆185Updated 7 months ago
- ☆40Updated 2 years ago
- ☆60Updated 3 years ago
- Parameter Efficient Transfer Learning with Diff Pruning☆73Updated 4 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- This package implements THOR: Transformer with Stochastic Experts.☆62Updated 3 years ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆74Updated last year
- Code for "Stochastic Optimization of Sorting Networks using Continuous Relaxations", ICLR 2019.☆139Updated 2 years ago
- ☆59Updated 2 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆141Updated last year
- ☆81Updated last year
- [NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features☆55Updated 2 years ago