PygmalionAI / data-toolboxLinks
Our data munging code.
☆34Updated last month
Alternatives and similar repositories for data-toolbox
Users that are interested in data-toolbox are comparing it to the libraries listed below
Sorting:
- Conversational Language model toolkit for training against human preferences.☆42Updated last year
- ☆26Updated 2 years ago
- Image Diffusion block merging technique applied to transformers based Language Models.☆55Updated 2 years ago
- An unsupervised model merging algorithm for Transformers-based language models.☆108Updated last year
- Instruct-tune LLaMA on consumer hardware☆72Updated 2 years ago
- Model REVOLVER, a human in the loop model mixing system.☆32Updated 2 years ago
- ChatGPT-like Web UI for RWKVstic☆100Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated 2 years ago
- Finetune any model on HF in less than 30 seconds☆55Updated last month
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated last year
- This project is established for real-time training of the RWKV model.☆49Updated last year
- Train Llama Loras Easily☆30Updated 2 years ago
- 4 bits quantization of SantaCoder using GPTQ☆51Updated 2 years ago
- Instruct-tuning LLaMA on consumer hardware☆65Updated 2 years ago
- Merge LLM that are split in to parts☆27Updated 3 months ago
- rwkv_chatbot☆61Updated 2 years ago
- ☆73Updated 2 years ago
- Gradio UI for RWKV LLM☆28Updated 2 years ago
- ☆51Updated last year
- ☆34Updated last year
- GPT-2 small trained on phi-like data☆67Updated last year
- Merge Transformers language models by use of gradient parameters.☆208Updated last year
- Framework agnostic python runtime for RWKV models☆146Updated 2 years ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆157Updated last year
- 4 bits quantization of LLaMa using GPTQ☆130Updated 2 years ago
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆143Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆62Updated 2 years ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆42Updated last year
- BlinkDL's RWKV-v4 running in the browser☆47Updated 2 years ago
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆41Updated 2 years ago