A pytorch realization of adafactor (https://arxiv.org/pdf/1804.04235.pdf )
☆26Aug 27, 2019Updated 6 years ago
Alternatives and similar repositories for adafactor-pytorch
Users that are interested in adafactor-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Episodic Memory Reader (EMR) https://arxiv.org/abs/1903.06164☆15Nov 16, 2022Updated 3 years ago
- Official Code Repository for the paper "Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation …☆20Jun 19, 2023Updated 2 years ago
- ☆15Mar 2, 2025Updated last year
- Sancho McCann's PhD Thesis Research Code☆25Oct 12, 2017Updated 8 years ago
- utilities for tensorflow2.x.x☆15Jul 19, 2023Updated 2 years ago
- ☆13Jan 15, 2025Updated last year
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆17Dec 15, 2021Updated 4 years ago
- ☆13Dec 13, 2024Updated last year
- ☆23Oct 20, 2020Updated 5 years ago
- MultiLabel classification of cow diseases by text and symptoms recognition (NER)☆12Aug 13, 2022Updated 3 years ago
- A tool to help adjust or zero-out Flux Block Weights and SAVE. I'm not a dev, so this implementation might be wrong.☆29Nov 20, 2024Updated last year
- Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization (IEEE TPAMI 2021)☆17Jun 4, 2021Updated 4 years ago
- Codebase for adaptive continual memory☆14Aug 15, 2023Updated 2 years ago
- Stereoscopic 3D toolkit for ComfyUI combining depth-based stereo generation with GPU acceleration, native VR viewing via PyOpenXR, and AI…☆41Feb 26, 2026Updated 3 weeks ago
- Variance Covariance Regularization☆14Jun 22, 2023Updated 2 years ago
- pytorch实现bert做seq2seq任务,使用unilm方案。☆10Apr 1, 2020Updated 5 years ago
- The Stream-51 dataset for streaming classification and novelty detection from videos.☆15Feb 22, 2022Updated 4 years ago
- ☆18May 5, 2023Updated 2 years ago
- (ICML 2023) Feature learning in deep classifiers through Intermediate Neural Collapse: Accompanying code☆16Jul 27, 2023Updated 2 years ago
- Code for "Self-Distillation as Instance-Specific Label Smoothing"☆16Oct 22, 2020Updated 5 years ago
- Unofficial PyTorch Implementation of StarGAN-ZSVC☆14Aug 5, 2021Updated 4 years ago
- This allows to create latent spaces filled with perlin-based noise that can actually be used by the samplers.☆34Aug 13, 2024Updated last year
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification (WWW'22)☆32Jun 21, 2025Updated 9 months ago
- A keras style's pytorch framework☆11Dec 10, 2019Updated 6 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆71Sep 25, 2024Updated last year
- ☆10Oct 8, 2018Updated 7 years ago
- ☆12Oct 20, 2023Updated 2 years ago
- [NeurIPS 2020] Official Implementation: "SMYRF: Efficient Attention using Asymmetric Clustering".☆50Sep 6, 2023Updated 2 years ago
- ☆21Jun 24, 2025Updated 9 months ago
- ☆15Apr 18, 2022Updated 3 years ago
- 基于中文的营销文本生成,基于Pointer Generator Network和Converage的实现,此外还尝试各种文本数据增广和优化技巧☆18Sep 5, 2020Updated 5 years ago
- A place to house minutes and other documents related to the core team.☆13Dec 16, 2020Updated 5 years ago
- torch7 wrapper for knn CUDA code☆10Dec 1, 2014Updated 11 years ago
- ☆12Oct 5, 2020Updated 5 years ago
- [NeurIPS 2024] Physics-Informed Regularization for Domain-Agnostic Dynamical System Modeling☆26Jul 10, 2025Updated 8 months ago
- ☆10Sep 16, 2020Updated 5 years ago
- ☆22Mar 16, 2024Updated 2 years ago
- Python package for calculation mahalanobis distances from NumPy arrays☆15Jun 22, 2022Updated 3 years ago