AlekseyKorshuk / role-play-synthetic
Synthetic Role-Play Conversation Dataset Generation
☆40Updated last year
Alternatives and similar repositories for role-play-synthetic:
Users that are interested in role-play-synthetic are comparing it to the libraries listed below
- An unsupervised model merging algorithm for Transformers-based language models.☆101Updated 8 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆115Updated last year
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆149Updated 11 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 9 months ago
- GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ☆99Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆165Updated 8 months ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- ☆107Updated 4 months ago
- A pipeline parallel training script for LLMs.☆116Updated this week
- ☆17Updated 8 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆198Updated 2 months ago
- ☆52Updated 7 months ago
- FuseAI Project☆76Updated last month
- Let's create synthetic textbooks together :)☆73Updated 11 months ago
- ☆109Updated 5 months ago
- Unofficial implementation of AlpaGasus☆90Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- A self-ailgnment method for role-play. Benchmark for role-play. Resources for "Large Language Models are Superpositions of All Characters…☆180Updated 7 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- ☆51Updated 5 months ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆163Updated last year
- Merge Transformers language models by use of gradient parameters.☆202Updated 5 months ago
- A benchmark for emotional intelligence in large language models☆212Updated 5 months ago
- Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.☆119Updated last week
- Home of FIREBALL: A Dataset of Dungeons and Dragons Actual-Play with Structured Game State Information (ACL 2023)☆45Updated 3 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆139Updated 3 months ago
- entropix style sampling + GUI☆25Updated 2 months ago