wenwei202/smoothout

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wenwei202/smoothout)

wenwei202 / smoothout

SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning

☆23

Alternatives and similar repositories for smoothout

Users that are interested in smoothout are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

keskarnitish / large-batch-training
View on GitHub
Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"
☆147Apr 24, 2017Updated 9 years ago
fdlm / Spaghetti
View on GitHub
Conditional Random Fields implemented as Lasagne layer
☆10Jul 22, 2016Updated 10 years ago
sleepinyourhat / quora-duplicate-questions-util
View on GitHub
Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.
☆14Jan 27, 2017Updated 9 years ago
bneyshabur / over-parametrization
View on GitHub
Computing various norms/measures on over-parametrized neural networks
☆50Nov 26, 2018Updated 7 years ago
okn-yu / Visualizing-the-Loss-Landscape-of-Neural-Nets
View on GitHub
☆19Jul 15, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Abhishaike / HyperProtoNetReproduce
View on GitHub
NeurIPS 2019 Paper Implementation
☆12Nov 22, 2022Updated 3 years ago
devansh20la / LPF-SGD
View on GitHub
☆17Dec 11, 2022Updated 3 years ago
shinseung428 / image_control_TF
View on GitHub
☆13Feb 17, 2018Updated 8 years ago
alex-damian / EOS
View on GitHub
☆15Sep 29, 2022Updated 3 years ago
renmengye / meta-optim-public
View on GitHub
Understanding Short-Horizon Bias in Stochastic Meta-Optimization
☆37Mar 8, 2018Updated 8 years ago
llan-ml / MetaTNE
View on GitHub
Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"
☆10Nov 17, 2020Updated 5 years ago
s11y / Gomenna-SideStep
View on GitHub
☆10Aug 18, 2016Updated 9 years ago
wenwei202 / autogrow
View on GitHub
AutoGrow: Automatic Layer Growing in Deep Convolutional Networks (KDD 2020)
☆40Jun 10, 2019Updated 7 years ago
facebookresearch / GAN-optimization-landscape
View on GitHub
code to reproduce the empirical results in the research paper
☆40Oct 12, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
flowersteam / Unsupervised_Goal_Space_Learning
View on GitHub
Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"
☆21Feb 14, 2018Updated 8 years ago
962086838 / code-for-Asymmetric-Valley
View on GitHub
This repo contains the code used for NeurIPS 2019 paper "Asymmetric Valleys: Beyond Sharp and Flat Local Minima".
☆14Oct 25, 2019Updated 6 years ago
victorywys / SMART-KPE
View on GitHub
Code for paper "Incorporating Multimodal Information in Open-Domain Web Keyphrase Extraction"
☆19Jan 28, 2021Updated 5 years ago
ninghaohello / Interpretation2Adversary
View on GitHub
Adversarial learning by utilizing model interpretation
☆10Oct 19, 2018Updated 7 years ago
yucornetto / GG-Transformer
View on GitHub
Code and models for the paper Glance-and-Gaze Vision Transformer
☆28Jun 7, 2021Updated 5 years ago
alexlouden / python-opencv-notebook
View on GitHub
Ready to run Jupyter notebook docker image with Python 3.9, OpenCV 4 and more
☆11Feb 12, 2022Updated 4 years ago
wenhuchen / GPT2-Logic2Text
View on GitHub
The code for Template-GPT-2 Generation Model for Logic2Text Dataset
☆18Jun 1, 2020Updated 6 years ago
k9k2 / qSGD
View on GitHub
SGD and Ordered SGD codes for deep learning, SVM, and logistic regression
☆36Aug 13, 2020Updated 5 years ago
ermongroup / BiasAndGeneralization
View on GitHub
☆28Apr 26, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
BaohaoLiao / frac-cot
View on GitHub
[COLM 2026] An efficient 3D sampling method for long-CoT LLM.
☆16May 25, 2025Updated last year
xinyuliu-jeffrey / EfficientViT_Model_Zoo
View on GitHub
This is the model zoo for our CVPR 2023 paper: EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention
☆14Mar 13, 2024Updated 2 years ago
BIU-NLP / Breaking_NLI
View on GitHub
NLI test set with lexical inferences
☆49Oct 2, 2018Updated 7 years ago
gd-zhang / noisy-quadratic-model
View on GitHub
Large-batch Training, Neural Network Optimization
☆10Nov 8, 2019Updated 6 years ago
michaelfarrell76 / Distributed-SGD
View on GitHub
Parallel SGD, done locally and remote
☆14May 19, 2016Updated 10 years ago
4uiiurz1 / pytorch-lars
View on GitHub
PyTorch implementation of LARS (Layer-wise Adaptive Rate Scaling)
☆20May 11, 2019Updated 7 years ago
shiwj16 / raa-drl
View on GitHub
☆11Apr 20, 2021Updated 5 years ago
Delikitty / Computer-Vision-16720-CMU
View on GitHub
☆14Jul 30, 2017Updated 8 years ago
apaszke / talks
View on GitHub
Slides from various talks I gave
☆18Oct 25, 2018Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
deeplearning-wisc / mllmshift-emi
View on GitHub
Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"
☆12May 27, 2025Updated last year
Olivia-fsm / DoGE
View on GitHub
Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"
☆21Feb 29, 2024Updated 2 years ago
HCDM / XRec
View on GitHub
Models for explainable recommendation.
☆12Jan 19, 2024Updated 2 years ago
edwin-de-jong / incremental-sequence-learning
View on GitHub
Implementation of the Incremental Sequence Learning algorithms described in the Incremental Sequence Learning article
☆40Sep 8, 2017Updated 8 years ago
VisionLearningGroup / SND
View on GitHub
☆13Oct 8, 2021Updated 4 years ago
nd7141 / recsystutorial
View on GitHub
☆15Sep 25, 2020Updated 5 years ago
ruiqi-zhong / Meta-tuning
View on GitHub
EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections
☆52Sep 15, 2021Updated 4 years ago