SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning
☆23Nov 21, 2018Updated 7 years ago
Alternatives and similar repositories for smoothout
Users that are interested in smoothout are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Conditional Random Fields implemented as Lasagne layer☆10Jul 22, 2016Updated 9 years ago
- Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.☆14Jan 27, 2017Updated 9 years ago
- Computing various norms/measures on over-parametrized neural networks☆50Nov 26, 2018Updated 7 years ago
- ☆18Nov 13, 2019Updated 6 years ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆14Sep 2, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Feb 17, 2018Updated 8 years ago
- Towards cross-lingual distributed representations without parallel text trained with adversarial autoencoders☆22Aug 11, 2016Updated 9 years ago
- A simple pytorch implementation of Differentiable Architecture Search (DARTS)☆22Aug 27, 2019Updated 6 years ago
- Official code for the paper "PERL: Pivot-based Domain Adaptation for Pre-trained Deep Contextualized Embedding Models".☆15Dec 8, 2022Updated 3 years ago
- Understanding Short-Horizon Bias in Stochastic Meta-Optimization☆37Mar 8, 2018Updated 8 years ago
- AutoGrow: Automatic Layer Growing in Deep Convolutional Networks (KDD 2020)☆40Jun 10, 2019Updated 6 years ago
- code to reproduce the empirical results in the research paper☆38Oct 12, 2021Updated 4 years ago
- This repo contains the code used for NeurIPS 2019 paper "Asymmetric Valleys: Beyond Sharp and Flat Local Minima".☆14Oct 25, 2019Updated 6 years ago
- ☆23Sep 4, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for paper "Incorporating Multimodal Information in Open-Domain Web Keyphrase Extraction"☆19Jan 28, 2021Updated 5 years ago
- Code and models for the paper Glance-and-Gaze Vision Transformer☆28Jun 7, 2021Updated 4 years ago
- Adversarial learning by utilizing model interpretation☆10Oct 19, 2018Updated 7 years ago
- A tutorial for using Hadoop with Python and Hive☆10May 26, 2015Updated 10 years ago
- The code for Template-GPT-2 Generation Model for Logic2Text Dataset☆18Jun 1, 2020Updated 5 years ago
- Web上に公開されている小説をスクレイピングして青空文庫形式のテキストにする☆19Feb 9, 2017Updated 9 years ago
- ☆11Jan 26, 2020Updated 6 years ago
- Code and manuscript for "Efficient Per-Example Gradient Computations in Convolutional Neural Networks"☆29Jan 26, 2020Updated 6 years ago
- ☆28Apr 26, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ONNX Integration Builds☆21May 21, 2018Updated 7 years ago
- Parallel SGD, done locally and remote☆14May 19, 2016Updated 9 years ago
- SGD and Ordered SGD codes for deep learning, SVM, and logistic regression☆36Aug 13, 2020Updated 5 years ago
- Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks☆18Nov 5, 2019Updated 6 years ago
- PyTorch implementation of LARS (Layer-wise Adaptive Rate Scaling)☆19May 11, 2019Updated 6 years ago
- ☆14Jul 30, 2017Updated 8 years ago
- ☆13May 11, 2021Updated 4 years ago
- ☆11Apr 20, 2021Updated 4 years ago
- All about acceleration and compression of Deep Neural Networks☆33Nov 5, 2019Updated 6 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- The codes are for the paper: ``Complete Dictionary Learning via \ell_p-norm Maximization'',Yifei Shen∗ , Ye Xue∗ , Jun Zhang , Khaled B. …☆11Nov 21, 2020Updated 5 years ago
- Models for explainable recommendation.☆12Jan 19, 2024Updated 2 years ago
- ☆13Oct 8, 2021Updated 4 years ago
- Lua implementation of Entropy-SGD☆81Apr 9, 2018Updated 7 years ago
- SurvivalQuilts: Temporal Quilting for Survival Analysis☆10Jan 9, 2024Updated 2 years ago
- Implementation of the Incremental Sequence Learning algorithms described in the Incremental Sequence Learning article☆40Sep 8, 2017Updated 8 years ago
- Slides from various talks I gave☆18Oct 25, 2018Updated 7 years ago