zhuchen03/gradinit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhuchen03/gradinit)

zhuchen03 / gradinit

Learning to Initialize Neural Networks for Stable and Efficient Training

☆138

Alternatives and similar repositories for gradinit

Users that are interested in gradinit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bonlime / sota_imagenet
View on GitHub
Code for training on Imagenet to SOTA results using PyTorch
☆13Aug 14, 2023Updated 2 years ago
vadimkantorov / convasr
View on GitHub
Baseline convolutional ASR system in PyTorch
☆21Nov 16, 2023Updated 2 years ago
acmi-lab / pretraining-with-nonsense
View on GitHub
Pretraining summarization models using a corpus of nonsense
☆13Sep 28, 2021Updated 4 years ago
bonlime / BAdam
View on GitHub
Adam with minor modifications which give significant improvement
☆19Aug 20, 2021Updated 4 years ago
layer6ai-labs / T-Fixup
View on GitHub
Code for the ICML'20 paper "Improving Transformer Optimization Through Better Initialization"
☆90Feb 1, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
JonasGeiping / dataaugs
View on GitHub
☆18Oct 12, 2022Updated 3 years ago
arsenyinfo / hookandlook
View on GitHub
A library helping to gather stats and run checks during training deep learning models with Pytorch
☆35Mar 6, 2022Updated 4 years ago
wronnyhuang / gen-viz
View on GitHub
Code for the paper "Understanding Generalization through Visualizations"
☆63Jan 15, 2021Updated 5 years ago
VITA-Group / Diverse-ViT
View on GitHub
[CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…
☆25Mar 9, 2022Updated 4 years ago
mshukor / ViCHA
View on GitHub
[BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"
☆54Oct 20, 2022Updated 3 years ago
mgrankin / over9000
View on GitHub
Over9000 optimizer
☆424Nov 22, 2022Updated 3 years ago
mil-ad / prospr
View on GitHub
Prospect Pruning: Finding Trainable Weights at Initialization Using Meta-Gradients
☆32Mar 30, 2022Updated 4 years ago
zhuchen03 / ConvexPolytopePosioning
View on GitHub
ConvexPolytopePosioning
☆37Jan 10, 2020Updated 6 years ago
RenkunNi / MetaAug
View on GitHub
☆27Sep 13, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zhoudaquan / Refiner_ViT
View on GitHub
☆110Sep 15, 2021Updated 4 years ago
montehoover / DynaGuard
View on GitHub
Code for "DynaGuard: A Dynamic Guardrail Model With User-Defined Policies."
☆23Nov 3, 2025Updated 8 months ago
nlpapereading / nlpapereading
View on GitHub
☆58Sep 23, 2022Updated 3 years ago
VITA-Group / Ultra-Data-Efficient-GAN-Training
View on GitHub
[NeurIPS'21] "Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly", Tianlong Chen, Yu Cheng, Zhe …
☆84Dec 30, 2021Updated 4 years ago
ivanpanshin / hist_cancer
View on GitHub
Histopathologic Cancer Detection model based on Kaggle Challenge https://www.kaggle.com/c/histopathologic-cancer-detection (top 1%)
☆11Feb 16, 2021Updated 5 years ago
taoyang1122 / GradAug
View on GitHub
[NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks
☆93Dec 16, 2020Updated 5 years ago
noiseQA / NoiseQA
View on GitHub
☆12Feb 22, 2021Updated 5 years ago
devnkong / GOAT
View on GitHub
Official implementation of GOAT model (ICML2023)
☆38Jul 3, 2023Updated 3 years ago
zeyademam / active_learning
View on GitHub
Code for Active Learning at The ImageNet Scale. This repository implements many popular active learning algorithms and allows training wi…
☆54Nov 29, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
solemn-leader / finnet
View on GitHub
Our 1st place solution to finnet challenge
☆10May 29, 2020Updated 6 years ago
alexandonian / contrastive-feature-loss
View on GitHub
PyTorch implementation of Contrastive Feature Loss for Image Prediction (AIM Workshop at ICCV 2021)
☆55Nov 19, 2021Updated 4 years ago
hpcgroup / loki
View on GitHub
Algorithms for approximate attention in LLMs
☆22Apr 14, 2025Updated last year
clovaai / AdamP
View on GitHub
AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)
☆412Jan 13, 2021Updated 5 years ago
bozheng-hit / VoCapXLM
View on GitHub
Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"
☆20Nov 12, 2021Updated 4 years ago
VITA-Group / SDCLR
View on GitHub
[ICML 2021] “ Self-Damaging Contrastive Learning”, Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang
☆64Dec 30, 2021Updated 4 years ago
thorikawa / akaze-opencv
View on GitHub
wrap AKAZE features implementatino to cv::Feature2D API without rebuilding OpenCV
☆15Oct 16, 2014Updated 11 years ago
ShoufaChen / CycleMLP
View on GitHub
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
☆290Apr 25, 2022Updated 4 years ago
amkatrutsa / advanced-opt
View on GitHub
Presentations of the advanced topics in optimization
☆11Oct 30, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
VITA-Group / Nasty-Teacher
View on GitHub
[ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Che…
☆83Dec 30, 2021Updated 4 years ago
defgsus / clipig
View on GitHub
OpenAI CLIP based image generator with complex config file controlled transformation and training pipelines
☆19Jan 4, 2022Updated 4 years ago
YingzhenLi / SteinGrad
View on GitHub
Code release for the ICLR paper
☆22Jun 13, 2018Updated 8 years ago
EsterHlav / Dynamical-Isometry-from-Orthogonality-Neural-Nets
View on GitHub
Mathematical consequences of orthogonal weights initialization and regularization in deep learning. Experiments with gain-adjusted orthog…
☆17Sep 21, 2019Updated 6 years ago
universome / firelab
View on GitHub
Experimental framework for running pytorch experiments
☆14Mar 6, 2023Updated 3 years ago
arpitbansal297 / Meta-Balance
View on GitHub
☆24Jan 27, 2022Updated 4 years ago
princeton-vl / think_visually
View on GitHub
Code for ACL 2018 paper 'Think Visually: Question Answering through Virtual Imagery'
☆13Mar 24, 2023Updated 3 years ago