A pytorch realization of adafactor (https://arxiv.org/pdf/1804.04235.pdf )
☆26Aug 27, 2019Updated 6 years ago
Alternatives and similar repositories for adafactor-pytorch
Users that are interested in adafactor-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apply surface laplacian transform to mne Epochs objects☆17Aug 3, 2023Updated 2 years ago
- ☆15Mar 2, 2025Updated last year
- ☆13Dec 13, 2024Updated last year
- PyTorch implementation of the SIESTA algorithm from our TMLR-2023 paper "SIESTA: Efficient Online Continual Learning with Sleep"☆13Oct 25, 2024Updated last year
- 文本生成 - 通过商品参数和图片自动生成营销文本☆12Sep 17, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆23Oct 20, 2020Updated 5 years ago
- A tool to help adjust or zero-out Flux Block Weights and SAVE. I'm not a dev, so this implementation might be wrong.☆29Nov 20, 2024Updated last year
- Variance Covariance Regularization☆14Jun 22, 2023Updated 2 years ago
- Stable Diffusion PNGINFO Beautify extension☆31Oct 9, 2025Updated 6 months ago
- pytorch实现bert做seq2seq任务,使用unilm方案。☆10Apr 1, 2020Updated 6 years ago
- The Stream-51 dataset for streaming classification and novelty detection from videos.☆15Feb 22, 2022Updated 4 years ago
- A Continual Learning Library in PyTorch and JAX☆13Apr 18, 2023Updated 3 years ago
- Code for "Self-Distillation as Instance-Specific Label Smoothing"☆15Oct 22, 2020Updated 5 years ago
- I modified some code of K-BERT so that it can be fit to English datasets Topics Resources☆11Dec 15, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This allows to create latent spaces filled with perlin-based noise that can actually be used by the samplers.☆34Aug 13, 2024Updated last year
- 用pytorch训练ssd,相比原版pytorch-ssd改动了不少☆11Jul 4, 2022Updated 3 years ago
- Extend the Conditioning of Stable Diffusion to take Audio Embeddings Instead of Text Embeddings using Wav2Vec2-BERT model☆13Sep 25, 2024Updated last year
- Implementing DropPath/StochasticDepth in PyTorch☆18Feb 5, 2022Updated 4 years ago
- Towards Sustainable Learning: Coresets for Data-efficient Deep Learning☆13Jul 5, 2023Updated 2 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆72Sep 25, 2024Updated last year
- ☆10Oct 8, 2018Updated 7 years ago
- A simple modification on the official DETR codebase with support to Finetune on custom dataset☆14Nov 26, 2020Updated 5 years ago
- CoreXY conversion for the Folgertech FT-5 printer☆15Feb 20, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Oct 20, 2023Updated 2 years ago
- [NeurIPS 2020] Official Implementation: "SMYRF: Efficient Attention using Asymmetric Clustering".☆50Sep 6, 2023Updated 2 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- A hybrid LaTeX code + rich-text editor. It seamlessly syncs a rich-text What You See Is What You Get (WYSIWYG) view with raw LaTeX source…☆33Mar 26, 2026Updated last month
- dracut module using vdfuse to loop mount☆11Mar 21, 2021Updated 5 years ago
- PyTorch implementation of the ExStream method from our ICRA-2019 paper "Memory Efficient Experience Replay for Streaming Learning"☆22Nov 26, 2019Updated 6 years ago
- ☆11Oct 5, 2020Updated 5 years ago
- ☆10Sep 16, 2020Updated 5 years ago
- ☆17Jan 19, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- Data Layers for Caffe☆13Apr 15, 2016Updated 10 years ago
- ☆22Mar 16, 2024Updated 2 years ago
- A c++ implementation for calculating the accuracy metrics (Accuracy, Error Rate, Precision(micro/macro), Recall(micro/macro), Fscore(micr…☆12Jul 2, 2019Updated 6 years ago
- PyTorch helper code☆10Dec 20, 2018Updated 7 years ago
- Create a Hopfield Network for Image Reconstruction☆18May 9, 2020Updated 5 years ago
- This repository contains example code to build models on TPUs☆30Feb 17, 2023Updated 3 years ago