[NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228
☆20Jan 7, 2022Updated 4 years ago
Alternatives and similar repositories for loss_landscape_taxonomy
Users that are interested in loss_landscape_taxonomy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2020] code for "Boundary thickness and robustness in learning models"☆20Dec 11, 2020Updated 5 years ago
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- Code for Neural Execution Engines: Learning to Execute Subroutines☆18Jan 11, 2021Updated 5 years ago
- ☆19Nov 10, 2024Updated last year
- This framework implements key experiments on the sparse double descent phenomenon (ICML 2022).☆15Dec 13, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Open source code for ICML 2025 Paper: Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias☆43Nov 14, 2025Updated 6 months ago
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11May 1, 2026Updated 3 weeks ago
- Localizing Memorized Sequences in Language Models☆22Oct 15, 2025Updated 7 months ago
- The official repository for AdaMuon☆39Aug 27, 2025Updated 9 months ago
- Mobilint Model Zoo Project☆20Updated this week
- Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)☆19Dec 15, 2023Updated 2 years ago
- H3M-SSMoEs: Hypergraph-based Multimodal Learning with LLM Reasoning and Style-Structured Mixture of Experts☆29Feb 20, 2026Updated 3 months ago
- Official PyTorch implementation of NeuralSVD (ICML 2024)☆24Sep 14, 2024Updated last year
- [NeurIPS 2022] "Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets" by Ruisi Cai*, Zhenyu Zh…☆21Oct 1, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ICML 2022 code for "Neurotoxin: Durable Backdoors in Federated Learning" https://arxiv.org/abs/2206.10341☆85Apr 1, 2023Updated 3 years ago
- ☆11Jul 21, 2024Updated last year
- fast trainer for educational purposes☆26May 4, 2026Updated 3 weeks ago
- Code for EMNLP 2018 paper https://arxiv.org/pdf/1808.09075.pdf☆38Aug 23, 2018Updated 7 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Jun 14, 2022Updated 3 years ago
- 2019年讯飞开发者大赛应用分类标注赛第一名解决方案☆12Oct 23, 2019Updated 6 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆39Mar 2, 2023Updated 3 years ago
- Official code for "In Search of Robust Measures of Generalization" (NeurIPS 2020)☆28Dec 22, 2020Updated 5 years ago
- [ICML2024] "FedLMT: Tackling System Heterogeneity of Federated Learning via Low-Rank Model Training with Theoretical Guarantees" by Jiaha…☆14Sep 22, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- Distilling Task-Specific Knowledge from BERT into Simple Neural Networks.☆15Aug 28, 2020Updated 5 years ago
- PyTorch implementation for "Gradient Surgery for Multi-Task Learning" https://arxiv.org/abs/2001.06782☆14Jul 6, 2020Updated 5 years ago
- Neural network approximators of linear algebra operations on GPU with PyTorch☆17May 30, 2022Updated 4 years ago
- FedUL: Federated Learning from Only Unlabeled Data with Class-Conditional-Sharing Clients☆33Jul 11, 2023Updated 2 years ago
- HKBU PhD/MPhil thesis template as well as key steps for submission☆37Sep 27, 2022Updated 3 years ago
- Code for Unsupervised Multi-Target Domain Adaptation: An Information Theoretic Approach☆14Jul 19, 2020Updated 5 years ago
- translation of VHL repo in paddle☆25Jun 28, 2023Updated 2 years ago
- NeurIPS'22 Oral: EquiVSet - Learning Neural Set Functions Under the Optimal Subset Oracle☆22Dec 23, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation. AAAI, 2025☆15Aug 25, 2025Updated 9 months ago
- Source code for the paper "Source of Transfer in Multilingual Named Entity Recognition"☆12Dec 8, 2022Updated 3 years ago
- This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".☆33Mar 11, 2025Updated last year
- When Reasoning Meets Its Laws☆37Jan 2, 2026Updated 4 months ago
- ☆10Sep 7, 2022Updated 3 years ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Jun 5, 2018Updated 7 years ago
- ☆14Dec 21, 2024Updated last year