Easy-to-use AdaHessian optimizer (PyTorch)
☆79Nov 12, 2020Updated 5 years ago
Alternatives and similar repositories for ada-hessian
Users that are interested in ada-hessian are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning☆286Feb 27, 2023Updated 3 years ago
- Analyze AdaHessian optimizer on 2D functions.☆13Aug 13, 2021Updated 4 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆218Apr 4, 2021Updated 5 years ago
- ☆43Jan 30, 2024Updated 2 years ago
- High performance pytorch modules☆18Jan 14, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A framework for implementing equivariant DL☆10May 25, 2021Updated 4 years ago
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆120Aug 4, 2021Updated 4 years ago
- [ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition☆34Oct 11, 2021Updated 4 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Jan 16, 2022Updated 4 years ago
- Starting from the 'r.vw' R interface to Vowpal Wabbit☆13Aug 22, 2018Updated 7 years ago
- R as a backend for web apps.☆10Mar 7, 2018Updated 8 years ago
- To be a next-generation DL-based phenotype prediction from genome mutations.☆19May 17, 2021Updated 4 years ago
- R package for Byte Pair Encoding based on YouTokenToMe☆16Sep 5, 2025Updated 8 months ago
- Manipulating External Pointer☆13Sep 27, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Python Research Framework☆107Nov 3, 2022Updated 3 years ago
- Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization☆182Nov 21, 2021Updated 4 years ago
- GPT, but made only out of MLPs☆89May 25, 2021Updated 4 years ago
- Catalyst.Detection☆12Sep 13, 2021Updated 4 years ago
- A variant of Transformer-XL where the memory is updated not with a queue, but with attention☆49Jul 31, 2020Updated 5 years ago
- Enhanced Fork-Based Parallelization for R☆16Jun 23, 2025Updated 10 months ago
- Meta-learning approach for human-interpretable formulas generation☆10Apr 24, 2020Updated 6 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆48Nov 30, 2021Updated 4 years ago
- ☆21Mar 15, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Pytorch LSTM implementation powered by Libtorch☆18Dec 26, 2022Updated 3 years ago
- Scalable Computation of Hessian Diagonals☆14Jun 2, 2024Updated last year
- Implementation of Computer Vision Models in JAX (equinox)☆23Apr 27, 2026Updated last week
- Hessian backpropagation (HBP): PyTorch extension of backpropagation for block-diagonal curvature matrix approximations☆22Mar 25, 2023Updated 3 years ago
- Authors implementation of LieTransformer: Equivariant Self-Attention for Lie Groups☆36Feb 5, 2021Updated 5 years ago
- Documentation:☆131May 22, 2023Updated 2 years ago
- EAST-inspired Tensorflow-based Text Detector☆11Feb 18, 2021Updated 5 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆149Oct 1, 2023Updated 2 years ago
- R bindings for the plog C++ logging library☆26Mar 8, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Generic API for dispatch to Pyro backends.☆16Feb 13, 2022Updated 4 years ago
- Experiments in protein folding through language modeling☆10Dec 10, 2021Updated 4 years ago
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- pyhessian is a TensorFlow module which can be used to estimate Hessian matrices☆25Mar 26, 2021Updated 5 years ago
- imagerExtra is an R package for image processing based on imager.☆12Jan 4, 2023Updated 3 years ago
- R package exposing the rapidjsonr c++ header-only library☆16Nov 23, 2025Updated 5 months ago
- A repository containing the code for the Bistable Recurrent Cell☆47Jan 3, 2021Updated 5 years ago