omry / hydra-article-code
☆18Updated 5 years ago
Alternatives and similar repositories for hydra-article-code
Users that are interested in hydra-article-code are comparing it to the libraries listed below
Sorting:
- [EMNLP'19] Summary for Transformer Understanding☆53Updated 5 years ago
- Efficient Neural Interaction Functions Search for Collaborative Filtering☆18Updated 5 years ago
- A pytorch realization of adafactor (https://arxiv.org/pdf/1804.04235.pdf )☆23Updated 5 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆37Updated 3 years ago
- Code for paper 'Minimizing FLOPs to Learn Efficient Sparse Representations' published at ICLR 2020☆20Updated 5 years ago
- ☆95Updated 2 years ago
- code for the ddp tutorial☆32Updated 3 years ago
- Implementation of Mogrifier LSTM in PyTorch☆35Updated 5 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch☆45Updated 4 years ago
- Repository for Multimodal AutoML Benchmark☆66Updated 3 years ago
- Large Scale BERT Distillation☆32Updated 2 years ago
- ☆37Updated 2 years ago
- Large dataset storage format for Pytorch☆45Updated 3 years ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆120Updated 3 years ago
- Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."☆135Updated 3 years ago
- ☆51Updated 4 years ago
- MTAdam: Automatic Balancing of Multiple Training Loss Terms☆36Updated 4 years ago
- [ICLR 2021] "UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems" by Jiayi Shen, Haotao Wang*, Shupeng Gui…☆39Updated 3 years ago
- Code for the anonymous submission "Cockpit: A Practical Debugging Tool for Training Deep Neural Networks"☆31Updated 4 years ago
- Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)☆29Updated 4 years ago
- Implementation of Multistream Transformers in Pytorch☆53Updated 3 years ago
- Feature Interaction Interpretability via Interaction Detection☆34Updated last year
- The official implementation of You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Natu…☆48Updated 3 years ago
- ☆22Updated 2 years ago
- A small framework mimics PyTorch using CuPy or NumPy☆27Updated 3 years ago
- Official cleanlab repo is at https://github.com/cleanlab/cleanlab☆57Updated 2 years ago
- Stochastic Weight Averaging Tutorials using pytorch.☆33Updated 4 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Updated last year
- ☆24Updated 2 years ago
- Awesome papers in few-shot learning/one-shot learning.☆30Updated 6 years ago