☆19Jun 10, 2024Updated last year
Alternatives and similar repositories for REQ
Users that are interested in REQ are comparing it to the libraries listed below
Sorting:
- ☆12Oct 5, 2020Updated 5 years ago
- ☆16Apr 26, 2023Updated 2 years ago
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆16Apr 27, 2019Updated 6 years ago
- ☆13Mar 22, 2023Updated 2 years ago
- Cross-library augmentation toolbox supporting 300 operators over 8 libraries + AI transforms☆12Jan 11, 2022Updated 4 years ago
- ☆15Jul 24, 2022Updated 3 years ago
- ☆36Sep 23, 2022Updated 3 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆70Sep 25, 2024Updated last year
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Sep 11, 2023Updated 2 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19May 19, 2019Updated 6 years ago
- Test-time-training on nearest neighbors for large language models☆49Apr 18, 2024Updated last year
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆52Dec 22, 2025Updated 2 months ago
- Official PyTorch code release for Implicit Gradient Transport, NeurIPS'19☆21Jun 11, 2019Updated 6 years ago
- ☆25May 20, 2020Updated 5 years ago
- [ICML'20] Multi Steepest Descent (MSD) for robustness against the union of multiple perturbation models.☆25Jul 25, 2024Updated last year
- ☆35Jun 13, 2023Updated 2 years ago
- ☆36Jan 23, 2024Updated 2 years ago
- ☆37Dec 19, 2024Updated last year
- Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness☆48Apr 9, 2021Updated 4 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Jun 14, 2022Updated 3 years ago
- Public Codebase for Rethinking Parameter Counting: Effective Dimensionality Revisited☆37Dec 27, 2022Updated 3 years ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆29Sep 12, 2025Updated 5 months ago
- ☆38Jun 10, 2021Updated 4 years ago
- ☆52Jun 10, 2024Updated last year
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- Lane segmentation model trained with tensorflow implementation MobileNetV2 based U-Net☆11Mar 24, 2023Updated 2 years ago
- Repo for the paper "Bounding Training Data Reconstruction in Private (Deep) Learning".☆11Jun 16, 2023Updated 2 years ago
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago
- ☆10Aug 26, 2022Updated 3 years ago
- ☆12Aug 3, 2021Updated 4 years ago
- Implementation of "Towards Understanding Mixture of Experts in Deep Learning", NeurIPS 2022☆10Jan 6, 2023Updated 3 years ago
- a fast implementation of BM25☆10Sep 15, 2022Updated 3 years ago
- Backprop with Low-Precision Activations☆11Oct 28, 2019Updated 6 years ago
- ☆16Oct 2, 2022Updated 3 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)☆51Jul 27, 2025Updated 7 months ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10May 16, 2018Updated 7 years ago
- ACL24☆11Jun 7, 2024Updated last year
- Accelerating Transfer Learning with Robust Neural Nets☆11Oct 2, 2020Updated 5 years ago