OpenMOSE / RWKV-Infer

A large-scale RWKV v6, v7 inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy on docker. Supports true multi-batch generation and dynamic State switching. CUDA and Rocm Supported :)
25Updated last week

Alternatives and similar repositories for RWKV-Infer:

Users that are interested in RWKV-Infer are comparing it to the libraries listed below