wleilei / rwkv_demo

A simple and easily understandable version of RWKV

☆13

Related projects ⓘ

Alternatives and complementary repositories for rwkv_demo

BBuf / RWKV-World-HF-Tokenizer
☆33Updated 3 months ago
frankxwang / dpo-prefix-sharing
DPO, but faster 🚀
☆21Updated 2 weeks ago
Triang-jyed-driung / RWKV-LM-RLHF-DPO
Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.
☆11Updated 8 months ago
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆39Updated 3 months ago
xhan77 / in-context-alignment
In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning
☆33Updated last year
SprocketLab / sparse_matrix_fine_tuning
Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"
☆16Updated this week
ssbuild / rwkv_finetuning
rwkv finetuning
☆35Updated 6 months ago
TorchRWKV / flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
☆13Updated this week
yynil / RWKVinLLAMA
☆14Updated this week
wangrui6 / Zhihu-KOL
☆27Updated last year
Zyphra / Zyda_processing
☆26Updated 4 months ago
40740 / Bert-VITS2-2
☆13Updated 8 months ago
lucasjinreal / ImageTokenizer
imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…
☆27Updated 4 months ago
neromous / RWKV-Ouroboros
This project is established for real-time training of the RWKV model.
☆50Updated 5 months ago
shoumenchougou / Awesome-RWKV-Prompts
用户友好、开箱即用的 RWKV Prompts 示例，适用于所有用户。Awesome RWKV Prompts for general users, more user-friendly, ready-to-use prompt examples.
☆29Updated 3 months ago
GreenBitAI / bitorch-engine
A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.
☆28Updated 4 months ago
umass-ml4ed / mathGPT
A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding.
☆33Updated last year
LAION-AI / riverbed
Tools for content datamining and NLP at scale
☆42Updated 4 months ago
JL-er / RWKV-PEFT
☆82Updated this week
KohakuBlueleaf / guanaco-lora
Instruct-tune LLaMA on consumer hardware
☆73Updated last year
princeton-nlp / ELIZA-Transformer
Representing Rule-based Chatbots with Transformers
☆18Updated 3 months ago
OpenNLPLab / LASP
Linear Attention Sequence Parallelism (LASP)
☆64Updated 5 months ago
Jellyfish042 / Sudoku-RWKV
☆21Updated last week
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated last year
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆55Updated 2 months ago
kyegomez / MobileVLM
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆14Updated 8 months ago
mrsteyk / RWKV-LM-deepspeed
☆42Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated 8 months ago
RWKV / RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…
☆21Updated 7 months ago
OpenMOSE / RWKV-LM-RLHF
Reinforcement Learning Toolkit for RWKV. Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning Let's boost the model's int…
☆18Updated last week