microsoft / deep-language-networksView on GitHub
We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts at each layer. We stack two such layers, feeding the output of one layer to the next. We call the stacked architecture a Deep Language Network - DLN
95Jul 25, 2024Updated last year

Alternatives and similar repositories for deep-language-networks

Users that are interested in deep-language-networks are comparing it to the libraries listed below

Sorting:

Are these results useful?