josephrocca / rwkv-v4-webLinks
BlinkDL's RWKV-v4 running in the browser
☆47Updated 2 years ago
Alternatives and similar repositories for rwkv-v4-web
Users that are interested in rwkv-v4-web are comparing it to the libraries listed below
Sorting:
- ☆13Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Updated 2 years ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year
- Conversational Language model toolkit for training against human preferences.☆42Updated last year
- A converter and basic tester for rwkv onnx☆43Updated last year
- Framework agnostic python runtime for RWKV models☆147Updated 2 years ago
- An unsupervised model merging algorithm for Transformers-based language models.☆108Updated last year
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago
- Chatbot that answers frequently asked questions in French, English, and Tunisian using the Rasa NLU framework and RWKV-4-Raven☆13Updated 2 years ago
- ChatGPT-like Web UI for RWKVstic☆100Updated 2 years ago
- Controllable Language Model Interactions in TypeScript☆10Updated last year
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Updated 2 years ago
- ☆27Updated 2 years ago
- Image Diffusion block merging technique applied to transformers based Language Models.☆56Updated 2 years ago
- Training a reward model for RLHF using RWKV.☆15Updated 2 years ago
- This project is established for real-time training of the RWKV model.☆50Updated last year
- Gradio UI for RWKV LLM☆29Updated 2 years ago
- Embeddings focused small version of Llama NLP model☆107Updated 2 years ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Updated 2 years ago
- ☆40Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated 2 years ago
- GPT-2 small trained on phi-like data☆67Updated last year
- A Qt GUI for large language models☆45Updated 2 years ago
- Course Project for COMP4471 on RWKV☆17Updated last year
- Where we keep our notes about model training runs.☆16Updated 2 years ago
- 4 bits quantization of SantaCoder using GPTQ☆51Updated 2 years ago
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆23Updated last year
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆47Updated last month
- GGML implementation of BERT model with Python bindings and quantization.☆58Updated last year