DonRL10 / RetNet

an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf
12Updated last year

Related projects: