recursal / minmodmon
Mini Model Daemon
☆11Updated 5 months ago
Alternatives and similar repositories for minmodmon:
Users that are interested in minmodmon are comparing it to the libraries listed below
- https://x.com/BlinkDL_AI/status/1884768989743882276☆27Updated 2 months ago
- RWKV, in easy to read code☆72Updated last month
- Course Project for COMP4471 on RWKV☆17Updated last year
- Inference RWKV v7 in pure C.☆31Updated 3 weeks ago
- RWKV-7: Surpassing GPT☆83Updated 5 months ago
- GoldFinch and other hybrid transformer components☆10Updated 3 weeks ago
- A fast RWKV Tokenizer written in Rust☆44Updated 3 weeks ago
- A large-scale RWKV v6, v7(World, ARWKV, PRWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy o…☆35Updated this week
- tinygrad port of the RWKV large language model.☆44Updated last month
- Some preliminary explorations of Mamba's context scaling.☆11Updated 4 months ago
- JAX implementations of RWKV☆19Updated last year
- Inference RWKV with multiple supported backends.☆40Updated this week
- Fast modular code to create and train cutting edge LLMs☆66Updated 11 months ago
- RWKV centralised docs for the community☆22Updated 3 weeks ago
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆71Updated 2 months ago
- Experiments with BitNet inference on CPU☆53Updated last year
- RWKV models and examples powered by candle.☆18Updated last month
- ☆49Updated last year
- ☆34Updated last month
- new optimizer☆19Updated 8 months ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Updated 2 years ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆31Updated 8 months ago
- GoldFinch and other hybrid transformer components☆45Updated 9 months ago
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs☆111Updated last year
- ☆13Updated 10 months ago
- RWKV in nanoGPT style☆189Updated 10 months ago
- ☆40Updated 2 years ago
- A specialized RWKV-7 model for Othello(a.k.a. Reversi) that predicts legal moves, evaluates positions, and performs in-context search. It…☆39Updated 3 months ago
- ☆22Updated 3 months ago
- ☆18Updated 3 months ago