nlpodyssey / rwkvLinks

RWKV (Receptance Weighted Key Value) is a RNN with Transformer-level performance

☆41

Alternatives and similar repositories for rwkv

Users that are interested in rwkv are comparing it to the libraries listed below

Sorting:

BlinkDL / WorldModel
Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…
☆39Updated 2 years ago
RWKV / RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆147Updated last year
BlinkDL / nanoRWKV
RWKV in nanoGPT style
☆193Updated last year
iamlemec / bert.cpp
GGML implementation of BERT model with Python bindings and quantization.
☆55Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Updated 2 years ago
harrisonvanderbyl / rwkvstic
Framework agnostic python runtime for RWKV models
☆145Updated 2 years ago
PABannier / biogpt.cpp
Port of Microsoft's BioGPT in C/C++ using ggml
☆85Updated last year
cwhy / rwkv-decon
Trying to deconstruct RWKV in understandable terms
☆14Updated 2 years ago
LAION-AI / blade2blade
Adversarial Training and SFT for Bot Safety Models
☆40Updated 2 years ago
sambanova / generative_data_prep
☆64Updated 5 months ago
NolanoOrg / llama-int4-quant
☆26Updated 2 years ago
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆56Updated last year
RWKV / rwkv-onnx
A converter and basic tester for rwkv onnx
☆42Updated last year
RWKV / RWKV-wiki
RWKV centralised docs for the community
☆29Updated 2 months ago
BlinkDL / LM-Trick-Questions
Here we collect trick questions and failed tasks for open source LLMs to improve them.
☆31Updated 2 years ago
Zyphra / Zyda_processing
☆39Updated last year
zsc / llama_infer
Inference script for Meta's LLaMA models using Hugging Face wrapper
☆109Updated 2 years ago
togethercomputer / OpenDataHub
☆127Updated 2 years ago
BlinkDL / Albatross
Efficient RWKV inference engine. RWKV7 7.2B fp16 decoding 10250 tps @ single 5090.
☆46Updated last week
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆115Updated 2 years ago
Durham / RWKV-finetune-script
Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset
☆31Updated 2 years ago
dust-tt / llama-ssp
Experiments on speculative sampling with Llama models
☆126Updated 2 years ago
Abel2076 / json2binidx_tool
☆81Updated last year
SeanNaren / min-LLM
Minimal code to train a Large Language Model (LLM).
☆172Updated 3 years ago
kroggen / mamba.c
Inference of Mamba models in pure C
☆191Updated last year
Agora-Lab-AI / Orca
An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"
☆42Updated last year
BlinkDL / fast.c
Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.
☆73Updated 8 months ago
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆39Updated 11 months ago
rahuldshetty / starcoder.js
Web browser version of StarCoder.cpp
☆44Updated 2 years ago
Jellyfish042 / uncheatable_eval
Evaluating LLMs with Dynamic Data
☆96Updated 2 months ago