OpenMOSE / RWKV-Infer

A large-scale RWKV v6 inference with FLA . Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy on docker. Supports true multi-batch generation and dynamic State switching. CUDA and Rocm Supported :)
16Updated last week

Related projects

Alternatives and complementary repositories for RWKV-Infer