lucidrains / PaLM-pytorch
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
☆819Updated 2 years ago
Alternatives and similar repositories for PaLM-pytorch:
Users that are interested in PaLM-pytorch are comparing it to the libraries listed below
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆859Updated last year
- Cramming the training of a (BERT-type) language model into limited compute.☆1,325Updated 9 months ago
- Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …☆631Updated last year
- Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.