cwhy / rwkv-deconLinks

Trying to deconstruct RWKV in understandable terms

☆14

Alternatives and similar repositories for rwkv-decon

Users that are interested in rwkv-decon are comparing it to the libraries listed below

Sorting:

karim-aloulou / Espitchatbot-RASA-RAVEN
Chatbot that answers frequently asked questions in French, English, and Tunisian using the Rasa NLU framework and RWKV-4-Raven
☆13Updated 2 years ago
tensorpro / tpu_rwkv
JAX implementations of RWKV
☆19Updated 2 years ago
lachlansneff / sparsellama
☆40Updated 2 years ago
josephrocca / rwkv-v4-web
BlinkDL's RWKV-v4 running in the browser
☆47Updated 2 years ago
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated 2 years ago
xaedes / llama.cpp
Port of Facebook's LLaMA model in C/C++
☆22Updated 2 years ago
iamlemec / bert.cpp
GGML implementation of BERT model with Python bindings and quantization.
☆58Updated last year
Hellisotherpeople / llm_steer-oobabooga
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆43Updated last year
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
Zyphra / Zyda_processing
☆39Updated last year
ZeldaHuang / rwkv-cpp-server
Easily deploy your rwkv model
☆19Updated 2 years ago
jquesnelle / ctranslate2-rs
Rust bindings for CTranslate2
☆14Updated 2 years ago
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆56Updated last year
silphendio / sliced_llama
Simple LLM inference server
☆20Updated last year
BlinkDL / WorldModel
Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…
☆40Updated 2 years ago
RWKV / rwkv-onnx
A converter and basic tester for rwkv onnx
☆43Updated last year
kyegomez / Falcon
A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…
☆12Updated last year
zarakiquemparte / zaraki-tools
☆27Updated 2 years ago
catid / bitnet_cpu
Experiments with BitNet inference on CPU
☆54Updated last year
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated last year
iantbutler01 / ditty
A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.
☆16Updated last year
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆29Updated last week
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆35Updated last year
jiamingkong / rwkv_reward
Training a reward model for RLHF using RWKV.
☆15Updated 2 years ago
kyegomez / Exa
Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…
☆26Updated last year
flawedmatrix / mamba-ssm
Implementation of mamba with rust
☆88Updated last year
harubaru / convogpt
Conversational Language model toolkit for training against human preferences.
☆42Updated last year
SonicCodes / subcloning
implementation of https://arxiv.org/pdf/2312.09299
☆21Updated last year
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year