pabloiyu / mini-language-modelLinks

Implementing Mamba SSM into a mini language model and training it on the open domain works of Sherlock Holmes. Also, implementation of parallel adapters into a transformer. Finally, code to run a quantized version of Mistral-7B.
9Updated last year

Alternatives and similar repositories for mini-language-model

Users that are interested in mini-language-model are comparing it to the libraries listed below

Sorting: