Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural network models (and their initializations) to make them easier to train.
☆79Jul 1, 2025Updated 10 months ago
Alternatives and similar repositories for dks
Users that are interested in dks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch optimizers implementing Hilbert Constrained Gradient Descent☆19May 9, 2019Updated 7 years ago
- Regularization, Neural Network Training Dynamics☆14Jan 13, 2020Updated 6 years ago
- Minimax Optimization, Stackelberg Games, Generative Adversarial Networks☆19Feb 14, 2020Updated 6 years ago
- Second Order Optimization and Curvature Estimation with K-FAC in JAX.☆324May 11, 2026Updated 2 weeks ago
- ☆13Jun 18, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A simple Jax implementation of influence functions.☆20Apr 9, 2024Updated 2 years ago
- A weak supervision framework for (partial) labeling functions☆16Jul 15, 2024Updated last year
- The ECMWF wave model ecWAM☆18May 18, 2026Updated last week
- ☆21Mar 3, 2025Updated last year
- ☆12Dec 7, 2017Updated 8 years ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Dec 21, 2022Updated 3 years ago
- Annealed Importance Sampling (AIS) for generative models.☆16Jul 20, 2018Updated 7 years ago
- Computing gradients and Hessians of feed-forward networks with GPU acceleration☆20Feb 14, 2024Updated 2 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆150Oct 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Minimal Implimentation of VCRec (2024) for collapse provention.☆18Jan 28, 2025Updated last year
- JMP is a Mixed Precision library for JAX.☆213Jan 30, 2025Updated last year
- A lightweight library for tensorflow 2.0☆65Dec 3, 2019Updated 6 years ago
- Code for "Picking Winning Tickets Before Training by Preserving Gradient Flow" https://openreview.net/pdf?id=SkgsACVKPH☆105Feb 18, 2020Updated 6 years ago
- ☆166Dec 13, 2023Updated 2 years ago
- ☆33Jul 8, 2024Updated last year
- ☆34Sep 10, 2024Updated last year
- Variational Autoencoders & Normalizing Flows Project☆18Dec 16, 2016Updated 9 years ago
- simple JAX-/NumPy-based implementations of NGD with exact/approximate Fisher Information Matrix both in parameter-space and function-spac…☆16Oct 21, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆14Apr 2, 2023Updated 3 years ago
- ☆19May 16, 2026Updated last week
- [EMNLP 2023] Official implementation of the algorithm ETSC: Exact Toeplitz-to-SSM Conversion our EMNLP 2023 paper - Accelerating Toeplitz…☆14Oct 17, 2023Updated 2 years ago
- Implementation for the PHM paper at ICLR'21☆13Mar 1, 2023Updated 3 years ago
- [ICML 2023] Decentralized SGD and Average-direction SAM are Asymptotically Equivalent☆20Dec 4, 2023Updated 2 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- Understanding Short-Horizon Bias in Stochastic Meta-Optimization☆37Mar 8, 2018Updated 8 years ago
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆18Sep 15, 2023Updated 2 years ago
- This repository is no longer maintained. Check☆81Apr 23, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- ☆24Feb 3, 2019Updated 7 years ago
- Hypercorn is an ASGI and WSGI Server based on Hyper libraries and inspired by Gunicorn.☆18Jan 12, 2026Updated 4 months ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Aug 29, 2023Updated 2 years ago
- Distilling Model Failures as Directions in Latent Space☆48Feb 8, 2023Updated 3 years ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆220Feb 13, 2023Updated 3 years ago
- Experiments for the paper "Exponential expressivity in deep neural networks through transient chaos"☆74Jun 9, 2016Updated 9 years ago