PygmalionAI / logbooksLinks
Where we keep our notes about model training runs.
☆16Updated 2 years ago
Alternatives and similar repositories for logbooks
Users that are interested in logbooks are comparing it to the libraries listed below
Sorting:
- Conversational Language model toolkit for training against human preferences.☆41Updated last year
- 4 bits quantization of SantaCoder using GPTQ☆51Updated 2 years ago
- BlinkDL's RWKV-v4 running in the browser☆47Updated 2 years ago
- Turns KoboldAI into a crowdsourced distributed cluster☆32Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆106Updated last year
- 4 bits quantization of LLMs using GPTQ☆49Updated 2 years ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Updated 2 years ago
- ☆27Updated last year
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance☆28Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago
- Framework agnostic python runtime for RWKV models☆146Updated 2 years ago
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆35Updated 2 years ago
- Our data munging code.☆34Updated 10 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- The official front-end UI.☆40Updated 2 years ago
- A collection of prompts for Llama☆100Updated 2 years ago
- The official service back-end.☆13Updated 2 years ago
- rwkv_chatbot☆62Updated 2 years ago
- ChatGPT-like Web UI for RWKVstic☆100Updated 2 years ago
- Merge Transformers language models by use of gradient parameters.☆207Updated last year
- Tools with GUI for GPT finetune data preparation☆22Updated 4 years ago
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆65Updated 2 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated last year
- GPT-2 small trained on phi-like data☆67Updated last year
- An extension to Oobabooga to add a simple memory function for chat☆25Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated 2 years ago
- Text WebUI extension to add clever Notebooks to Chat mode☆142Updated 3 weeks ago
- Train Llama Loras Easily☆31Updated 2 years ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated 2 years ago
- The code we currently use to fine-tune models.☆115Updated last year