AlekseyKorshuk / role-play-syntheticLinks
Synthetic Role-Play Conversation Dataset Generation
☆43Updated last year
Alternatives and similar repositories for role-play-synthetic
Users that are interested in role-play-synthetic are comparing it to the libraries listed below
Sorting:
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆155Updated last year
- A benchmark for role-playing language models☆99Updated last month
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated last year
- Merge Transformers language models by use of gradient parameters.☆206Updated 10 months ago
- Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.☆127Updated 5 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆129Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆91Updated last year
- ☆53Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆78Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆131Updated last year
- Unofficial implementation of AlpaGasus☆91Updated last year
- A self-ailgnment method for role-play. Benchmark for role-play. Resources for "Large Language Models are Superpositions of All Characters…☆196Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated last year
- manage histories of LLM applied applications☆90Updated last year
- A simple converter which converts pytorch bin files to safetensor, intended to be used for LLM conversion.☆69Updated last year
- 4 bits quantization of LLaMa using GPTQ☆129Updated 2 years ago
- FuseAI Project☆87Updated 5 months ago
- ☆17Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆78Updated last year
- Our data munging code.☆34Updated 8 months ago
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆23Updated 8 months ago
- ☆76Updated last year
- A bagel, with everything.☆321Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆43Updated 2 years ago
- ☆157Updated 11 months ago
- ☆73Updated last year
- A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size☆58Updated 2 months ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆137Updated 11 months ago
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆83Updated last year