UnstoppableCurry / RWKV-LM-Interpretability-Research
Interpretability analysis of language model outlier and attempts to distill the model
☆13Updated last year
Alternatives and similar repositories for RWKV-LM-Interpretability-Research:
Users that are interested in RWKV-LM-Interpretability-Research are comparing it to the libraries listed below
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated last year
- JAX implementations of RWKV☆19Updated last year
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Updated last year
- Training a reward model for RLHF using RWKV.☆14Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- ☆40Updated last year
- A converter and basic tester for rwkv onnx☆42Updated last year
- tinygrad port of the RWKV large language model.☆44Updated 8 months ago
- A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…☆13Updated 11 months ago
- Chatbot that answers frequently asked questions in French, English, and Tunisian using the Rasa NLU framework and RWKV-4-Raven☆13Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!☆42Updated last year
- 🎮👥 Experience the future of multiplayer gaming with MUDGPT's AI-generated virtual world! 🌟🤖☆41Updated last year
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 4 months ago
- Course Project for COMP4471 on RWKV☆17Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆16Updated 4 months ago
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆41Updated last year
- ☆44Updated 7 months ago
- ☆21Updated 2 months ago
- Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆28Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- RWKV-7: Surpassing GPT☆79Updated 3 months ago
- Training Models Daily☆17Updated last year
- ☆15Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆28Updated this week
- Simple Autogpt with tree of thoughts☆15Updated last year
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama models☆36Updated last year