recursal / minmodmon
Mini Model Daemon
☆11Updated 4 months ago
Alternatives and similar repositories for minmodmon:
Users that are interested in minmodmon are comparing it to the libraries listed below
- https://x.com/BlinkDL_AI/status/1884768989743882276☆27Updated last month
- Course Project for COMP4471 on RWKV☆17Updated last year
- GoldFinch and other hybrid transformer components☆10Updated 2 weeks ago
- A large-scale RWKV v6, v7(World, ARWKV, PRWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy o…☆33Updated last week
- tinygrad port of the RWKV large language model.☆44Updated 3 weeks ago
- Inference RWKV with multiple supported backends.☆39Updated this week
- A fast RWKV Tokenizer written in Rust☆44Updated last week
- RWKV, in easy to read code☆71Updated last week
- RWKV-7: Surpassing GPT☆82Updated 4 months ago
- Fast modular code to create and train cutting edge LLMs☆66Updated 10 months ago
- Inference RWKV v7 in pure C.☆13Updated this week
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Updated last year
- Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…☆35Updated last week
- ☆9Updated 10 months ago
- The training notebooks that were similar to the original script used to train TinyMistral.☆21Updated last year
- ☆32Updated last week
- Some preliminary explorations of Mamba's context scaling.☆11Updated 3 months ago
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated 2 years ago
- ☆18Updated 3 months ago
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆70Updated last month
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- ☆12Updated 3 months ago
- ☆22Updated 3 months ago
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Updated last year
- Experiments with BitNet inference on CPU☆53Updated last year
- RWKV centralised docs for the community☆21Updated this week
- BlinkDL's RWKV-v4 running in the browser☆47Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 10 months ago
- Inference of Mamba models in pure C☆187Updated last year
- RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …☆13Updated last year