LIJUNYI95 / SuperAdamView external linksLinks
Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'
☆17Jan 12, 2022Updated 4 years ago
Alternatives and similar repositories for SuperAdam
Users that are interested in SuperAdam are comparing it to the libraries listed below
Sorting:
- awesome unsupervised learning paper list☆12Jan 4, 2018Updated 8 years ago
- ☆12Dec 11, 2020Updated 5 years ago
- A presentation on Augmented CycleGAN and the papers that lead up to it☆11Dec 3, 2018Updated 7 years ago
- Official code for the paper "Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks".☆16Dec 7, 2021Updated 4 years ago
- Our implementation of Shampoo optimizer based on https://arxiv.org/pdf/1802.09568.pdf☆12Dec 23, 2019Updated 6 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18May 10, 2023Updated 2 years ago
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency☆17Dec 2, 2021Updated 4 years ago
- [ NeurIPS '22 ] Data distillation for recommender systems. Shows equivalent performance with 2-3 orders less data.☆23Jun 8, 2023Updated 2 years ago
- Stochastic Optimization for Global Contrastive Learning without Large Mini-batches☆20Mar 31, 2023Updated 2 years ago
- Code for paper 'Minimizing FLOPs to Learn Efficient Sparse Representations' published at ICLR 2020☆20Feb 14, 2020Updated 6 years ago
- ☆19Jan 27, 2021Updated 5 years ago
- A CLIP conditioned Decision Transformer.☆22Jul 14, 2021Updated 4 years ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Jul 16, 2022Updated 3 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Nov 30, 2021Updated 4 years ago
- ☆21Mar 15, 2023Updated 2 years ago
- Code of our Neurips2020 paper "Auto Learning Attention", coming soon☆22Apr 14, 2021Updated 4 years ago
- ☆26Sep 15, 2022Updated 3 years ago
- 整理cvpr论文,包括摘要,动机,架构,结果,总结☆27Dec 15, 2018Updated 7 years ago
- ☆64Nov 4, 2021Updated 4 years ago
- Code for CELL-E: Biological Zero-Shot Text-to-Image Synthesis for Protein Localization Prediction☆29Oct 1, 2023Updated 2 years ago
- 记录每一个常用的深度模型结构的特点(图和代码)☆30Dec 17, 2018Updated 7 years ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆33Apr 16, 2024Updated last year
- Filter Response Normalization Layer in PyTorch☆121Feb 3, 2020Updated 6 years ago
- Implements the SM3-II adaptive optimization algorithm for PyTorch.☆33Sep 3, 2024Updated last year
- A TensorFlow implementation of NRTR, a No-Recurrence Seq2Seq Model for Scene Text Recognition☆31Sep 1, 2019Updated 6 years ago
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆44Oct 29, 2021Updated 4 years ago
- Compressing Representations for Self-Supervised Learning☆80Feb 18, 2021Updated 4 years ago
- 手摸手 美团 YOLOv6模型训练和TensorRT端到端部署方案教程☆34Jun 30, 2022Updated 3 years ago
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆36Dec 17, 2021Updated 4 years ago
- 使用numpy构建cnn复习深度学习知识☆35Sep 11, 2018Updated 7 years ago
- Octave convolution☆34Jan 22, 2022Updated 4 years ago
- Another attempt at a long-context / efficient transformer by me☆38Apr 11, 2022Updated 3 years ago
- Disable the Force Stop & Uninstall button in Manage Application using Device Administration in Android.☆11Feb 12, 2014Updated 12 years ago
- ☆13Jul 20, 2023Updated 2 years ago
- ☆11Feb 28, 2022Updated 3 years ago
- SKFAC Preconditioner for MindSpore☆12Jul 2, 2021Updated 4 years ago
- Test-Time Label-Shift Adaptation☆13May 24, 2023Updated 2 years ago
- ☆11Apr 8, 2024Updated last year
- Topic modelling and co-occurrence analysis of the bio-economy☆10Jul 17, 2017Updated 8 years ago