☆19Jun 10, 2024Updated last year
Alternatives and similar repositories for REQ
Users that are interested in REQ are comparing it to the libraries listed below
Sorting:
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- ☆16Apr 26, 2023Updated 2 years ago
- ☆12Oct 5, 2020Updated 5 years ago
- ☆16Dec 9, 2023Updated 2 years ago
- ☆15Jul 24, 2022Updated 3 years ago
- Cross-library augmentation toolbox supporting 300 operators over 8 libraries + AI transforms☆12Jan 11, 2022Updated 4 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆71Sep 25, 2024Updated last year
- ☆13Mar 22, 2023Updated 2 years ago
- ☆35Sep 23, 2022Updated 3 years ago
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆16Apr 27, 2019Updated 6 years ago
- A School for All Seasons on Trustworthy Machine Learning☆12Jun 30, 2021Updated 4 years ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆52Dec 22, 2025Updated 2 months ago
- ☆36Jan 23, 2024Updated 2 years ago
- ☆25May 20, 2020Updated 5 years ago
- RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network☆15Oct 18, 2022Updated 3 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆44Sep 11, 2023Updated 2 years ago
- a fast implementation of BM25☆10Sep 15, 2022Updated 3 years ago
- Code for T-MARS data filtering☆35Aug 23, 2023Updated 2 years ago
- Implementation of "Towards Understanding Mixture of Experts in Deep Learning", NeurIPS 2022☆10Jan 6, 2023Updated 3 years ago
- Test-time-training on nearest neighbors for large language models☆49Apr 18, 2024Updated last year
- [CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"☆20Apr 21, 2024Updated last year
- Official PyTorch code release for Implicit Gradient Transport, NeurIPS'19☆21Jun 11, 2019Updated 6 years ago
- Fine-grained ImageNet annotations☆30May 25, 2020Updated 5 years ago
- [ICML'20] Multi Steepest Descent (MSD) for robustness against the union of multiple perturbation models.☆25Jul 25, 2024Updated last year
- Machine learning project using federated learning for text generation☆11May 5, 2024Updated last year
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19May 19, 2019Updated 6 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Oct 18, 2023Updated 2 years ago
- Public Codebase for Rethinking Parameter Counting: Effective Dimensionality Revisited☆37Dec 27, 2022Updated 3 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10May 16, 2018Updated 7 years ago
- ☆17Dec 19, 2024Updated last year
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆30Sep 12, 2025Updated 6 months ago
- some mixture of experts architecture implementations☆26Mar 22, 2024Updated last year
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 7 months ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Federated Learning - PyTorch☆15Jun 27, 2021Updated 4 years ago
- Code for "Exponential Family Estimation via Adversarial Dynamics Embedding" (NeurIPS 2019)☆14Nov 26, 2019Updated 6 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- ☆38Dec 19, 2024Updated last year
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago