LC1332 / Haruhi-2-Dev
Just for debug
☆56Updated last year
Alternatives and similar repositories for Haruhi-2-Dev
Users that are interested in Haruhi-2-Dev are comparing it to the libraries listed below
Sorting:
- The plan which extend ChatHaruhi into Zero-shot Roleplaying model☆104Updated last year
- Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.☆126Updated 4 months ago
- ☆238Updated 5 months ago
- deep learning☆149Updated last week
- RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models☆496Updated 7 months ago
- ☆226Updated last year
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆164Updated last year
- SUS-Chat: Instruction tuning done right☆48Updated last year
- 使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆29Updated last week
- Based on the Evol-character framework and OpenAI API, enabling fine-grained role-playing data generation 🎭🧩.☆31Updated last year
- 文本去重☆71Updated 11 months ago
- 骆驼大乱斗: Massive Game Content Generated by LLM☆19Updated last year
- [EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models☆464Updated 4 months ago
- 我们是第一个完全可商用的角色大模型。☆40Updated 9 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆136Updated 5 months ago
- A dataset template for guiding chat-models to self-cognition, including information about the model’s identity, capabilities, usage, limi…☆27Updated last year
- 用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆250Updated last year
- zero零训练llm调参☆31Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- llama inference for tencentpretrain☆98Updated last year
- Kanchil(鼷鹿)是世界上最小的偶蹄目动物,这个开源项目意在探索小模型(6B以下)是否也能具备和人类偏好对齐的能力。☆113Updated 2 years ago
- Imitate OpenAI with Local Models☆88Updated 8 months ago
- 骆驼QA,中文大语言阅读理解模型。☆74Updated last year
- chatglm_rlhf_finetuning☆29Updated last year
- ☆128Updated last year
- Light local website for displaying performances from different chat models.☆86Updated last year
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆441Updated 7 months ago
- 大语言模型指令调优工具(支持 FlashAttention)☆172Updated last year
- 对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF☆192Updated last year
- Its an open source LLM based on MOE Structure.☆58Updated 10 months ago