[NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228
☆20Jan 7, 2022Updated 4 years ago
Alternatives and similar repositories for loss_landscape_taxonomy
Users that are interested in loss_landscape_taxonomy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2020] code for "Boundary thickness and robustness in learning models"☆20Dec 11, 2020Updated 5 years ago
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆34Jun 9, 2025Updated 11 months ago
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- Benchmarking Semi-supervised Federated Learning☆54Aug 14, 2022Updated 3 years ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This framework implements key experiments on the sparse double descent phenomenon (ICML 2022).☆15Dec 13, 2022Updated 3 years ago
- Open source code for ICML 2025 Paper: Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias☆43Nov 14, 2025Updated 5 months ago
- Codes for "Two Sides of the Same Coin: Heterophily and Oversmoothing in Graph Convolutional Neural Networks"☆43Mar 18, 2023Updated 3 years ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 11 months ago
- Code Repository for the NeurIPS 2021 paper: "Self-Supervised Representation Learning on Neural Network Weights for Model Characteristic P…☆22Jul 10, 2024Updated last year
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11May 1, 2026Updated last week
- Localizing Memorized Sequences in Language Models☆22Oct 15, 2025Updated 6 months ago
- The official repository for AdaMuon☆38Aug 27, 2025Updated 8 months ago
- Mobilint Model Zoo Project☆20Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)☆19Dec 15, 2023Updated 2 years ago
- [NeurIPS 2022] "Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets" by Ruisi Cai*, Zhenyu Zh…☆21Oct 1, 2022Updated 3 years ago
- Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure (NeurIPS 2024) + Arithmetic Transfor…☆14Oct 26, 2025Updated 6 months ago
- Code for AAAI 2024 paper: CR-SAM: Curvature Regularized Sharpness-Aware Minimization☆12Nov 29, 2024Updated last year
- Code for EMNLP 2018 paper https://arxiv.org/pdf/1808.09075.pdf☆38Aug 23, 2018Updated 7 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Jun 14, 2022Updated 3 years ago
- ☆19Apr 16, 2025Updated last year
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- FedUL: Federated Learning from Only Unlabeled Data with Class-Conditional-Sharing Clients☆33Jul 11, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- HKBU PhD/MPhil thesis template as well as key steps for submission☆37Sep 27, 2022Updated 3 years ago
- ☆18Jan 17, 2024Updated 2 years ago
- ☆14Dec 21, 2024Updated last year
- translation of VHL repo in paddle☆25Jun 28, 2023Updated 2 years ago
- ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation. AAAI, 2025☆14Aug 25, 2025Updated 8 months ago
- Source code for the paper "Source of Transfer in Multilingual Named Entity Recognition"☆12Dec 8, 2022Updated 3 years ago
- When Reasoning Meets Its Laws☆37Jan 2, 2026Updated 4 months ago
- ☆10Sep 7, 2022Updated 3 years ago
- [ICLR 2023, Spotlight] Indiscriminate Poisoning Attacks on Unsupervised Contrastive Learning☆31Dec 2, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10Oct 31, 2022Updated 3 years ago
- PyTorch implementation of LAMB for ImageNet/ResNet-50 training☆13May 13, 2021Updated 4 years ago
- Learning with Noisy Labels, Label Noise, ICML 2021☆46Mar 28, 2023Updated 3 years ago
- [ICML 2024 Oral] LSH-Based Efficient Point Transformer (HEPT)☆25Jan 24, 2025Updated last year
- ICML2022: Virtual Homogeneity Learning: Defending against Data Heterogeneity in Federated Learning☆41Oct 9, 2022Updated 3 years ago
- Use GCN to classify Mnist☆11Mar 19, 2020Updated 6 years ago
- ☆15Mar 12, 2024Updated 2 years ago