sunyt32 / torchscale
Transformers at any scale
☆41Updated last year
Alternatives and similar repositories for torchscale:
Users that are interested in torchscale are comparing it to the libraries listed below
- Retrieval as Attention☆83Updated 2 years ago
- Code for paper 'Data-Efficient FineTuning'☆29Updated last year
- DEMix Layers for Modular Language Modeling☆53Updated 3 years ago
- ☆96Updated last year
- ☆24Updated 2 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation