hizkifw / WebChatRWKVstic
ChatGPT-like Web UI for RWKVstic
☆100Updated last year
Related projects ⓘ
Alternatives and complementary repositories for WebChatRWKVstic
- Framework agnostic python runtime for RWKV models☆145Updated last year
- Gradio UI for RWKV LLM☆28Updated last year
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆70Updated last year
- Enhancing LangChain prompts to work better with RWKV models☆34Updated last year
- BlinkDL's RWKV-v4 running in the browser☆47Updated last year
- A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…☆307Updated 9 months ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆406Updated last year
- rwkv_chatbot☆62Updated last year
- 📖 — Notebooks related to RWKV☆59Updated last year
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆50Updated last year
- Instruct-tune LLaMA on consumer hardware☆73Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- 4 bits quantization of SantaCoder using GPTQ☆53Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆99Updated 6 months ago
- A converter and basic tester for rwkv onnx☆41Updated 9 months ago
- 4 bits quantization of LLaMa using GPTQ☆130Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year
- This project is established for real-time training of the RWKV model.☆50Updated 5 months ago
- Efficient 3bit/4bit quantization of LLaMA models☆19Updated last year
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆63Updated last year
- 8-bit CUDA functions for PyTorch☆45Updated last year
- SoTA Transformers with C-backend for fast inference on your CPU.☆312Updated 11 months ago
- ☆81Updated 5 months ago
- tinygrad port of the RWKV large language model.☆43Updated 4 months ago
- Conversational Language model toolkit for training against human preferences.☆40Updated 7 months ago
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆41Updated last year
- A QQ Chatbot based on RWKV (W.I.P.)☆78Updated 11 months ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- Collection of various text datasets to assist ML researchers in training or fine-tuning their models☆20Updated last year
- Inference code for facebook LLaMA models with Wrapyfi support☆130Updated last year