andrewkchan / yalm

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O
211Updated this week

Alternatives and similar repositories for yalm:

Users that are interested in yalm are comparing it to the libraries listed below