pabloiyu / mini-language-model

Implementing Mamba SSM into a mini language model and training it on the open domain works of Sherlock Holmes. Also, implementation of parallel adapters into a transformer. Finally, code to run a quantized version of Mistral-7B.
9Updated 10 months ago

Alternatives and similar repositories for mini-language-model:

Users that are interested in mini-language-model are comparing it to the libraries listed below