Magnetic2014/RoleEval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Magnetic2014/RoleEval)

Magnetic2014 / RoleEval

A Bilingual Role Evaluation Benchmark for Large Language Models

☆43

Alternatives and similar repositories for RoleEval

Users that are interested in RoleEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

morecry / CharacterEval
View on GitHub
☆301May 27, 2025Updated last year
X-PLUG / SocialBench
View on GitHub
RoleInteract: Evaluating the Social Interaction of Role-Playing Agents
☆70Oct 12, 2024Updated last year
OFA-Sys / Ditto
View on GitHub
A self-ailgnment method for role-play. Benchmark for role-play. Resources for "Large Language Models are Superpositions of All Characters…
☆212May 28, 2024Updated 2 years ago
InteractiveNLP-Team / RoleLLM-public
View on GitHub
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
☆528Oct 11, 2024Updated last year
Magnetic2014 / llm-alignment-survey
View on GitHub
A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…
☆82Sep 28, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Joanna0123 / character_profiling
View on GitHub
Code and Data for the paper "Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works".
☆22Jul 24, 2024Updated last year
ahnjaewoo / timechara
View on GitHub
🧙🏻 Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing…
☆21Dec 20, 2024Updated last year
blendmaster / rigid-faces
View on GitHub
As-rigid-as-possible face deformation
☆12Apr 18, 2014Updated 12 years ago
choosewhatulike / trainable-agents
View on GitHub
Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"
☆641Oct 29, 2024Updated last year
CUHK-ARISE / LLMPersonality
View on GitHub
Code and data for the paper: On the Reliability of Psychological Scales on Large Language Models
☆31Dec 15, 2025Updated 7 months ago
Bauhinia-AI / evol-character
View on GitHub
Based on the Evol-character framework and OpenAI API, enabling fine-grained role-playing data generation 🎭🧩.
☆29Feb 1, 2024Updated 2 years ago
weiyifan1023 / Neeko
View on GitHub
Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"
☆140Jul 23, 2025Updated last year
thu-coai / CharacterGLM-6B
View on GitHub
[EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
☆505Oct 2, 2025Updated 9 months ago
nuochenpku / Awesome-Role-Play-Papers
View on GitHub
Awesome papers for role-playing with language models
☆229Nov 3, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
PedroUria / NLP-Movie_Scripts
View on GitHub
Trying to predict a movie's success based on the script (before filming)
☆50Feb 2, 2020Updated 6 years ago
njuzrs / dialogue_distillation
View on GitHub
☆15Nov 3, 2022Updated 3 years ago
RUCAIBox / HaluEval
View on GitHub
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
☆592Feb 12, 2024Updated 2 years ago
rrkarim / unbounded-cache-lm
View on GitHub
Unbounded cache model for online language modeling with open vocabulary
☆11Feb 15, 2019Updated 7 years ago
Neph0s / InCharacter
View on GitHub
Official code for the paper: InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews (previo…
☆100May 27, 2025Updated last year
Minami-su / character_AI_open
View on GitHub
Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.
☆134Jan 9, 2025Updated last year
THU-KEG / KoLA
View on GitHub
[ICLR24] The open-source repo of THU-KEG's KoLA benchmark.
☆57Sep 28, 2023Updated 2 years ago
PlusLabNLP / Narrative-Discourse
View on GitHub
☆16Nov 5, 2024Updated last year
likenneth / dialogue_action_token
View on GitHub
Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner
☆31Jun 27, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Freder-chen / ReasonGenRM
View on GitHub
A simple implementation of ReasonGenRM.
☆19Apr 21, 2025Updated last year
tsosea2 / eMLM
View on GitHub
This is the code for our ACL 2021 paper entitled eMLM: A New Pre-training Objective for Emotion Related Tasks
☆15Sep 7, 2022Updated 3 years ago
yzhang1918 / cikm2022rudi
View on GitHub
Codes and data for CIKM 2022 paper "RuDi: Explaining Behavior Sequence Models by Automatic Statistics Generation and Rule Distillation"
☆12Aug 16, 2022Updated 3 years ago
CLUEbenchmark / SuperCLUE-Video
View on GitHub
中文原生多层次文生视频测评基准
☆18Jul 8, 2024Updated 2 years ago
X-PLUG / CValues
View on GitHub
面向中文大模型价值观的评估与对齐研究
☆560Jul 20, 2023Updated 3 years ago
GanjinZero / RRHF
View on GitHub
[NIPS2023] RRHF & Wombat
☆806Sep 22, 2023Updated 2 years ago
TheNormativityLab / talk-aint-cheap
View on GitHub
☆17Sep 2, 2025Updated 10 months ago
namkoong-lab / PersonalLLM
View on GitHub
☆18Oct 8, 2024Updated last year
gczr / WideAndDeep
View on GitHub
利用pytorch实现的wide&deep，并利用avazu数据集进行了验证
☆10Feb 4, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
chongyangtao / LLMs-for-NLG-Evaluation
View on GitHub
Awesome LLM for NLG Evaluation Papers
☆26Jan 23, 2024Updated 2 years ago
MiuLab / PersonaLLM-Survey
View on GitHub
☆116Oct 11, 2024Updated last year
OSU-NLP-Group / reversal-curse-binding
View on GitHub
☆25Apr 3, 2025Updated last year
silverriver / PersonalDilaog
View on GitHub
Scripts for constructing the PersonalDialog dataset (https://arxiv.org/abs/1901.09672)
☆44Sep 13, 2022Updated 3 years ago
QunBB / WBDC2021
View on GitHub
☆20Sep 1, 2021Updated 4 years ago
caoyu-noob / D3
View on GitHub
The implementation for ACL 2022 paper
☆20Aug 14, 2022Updated 3 years ago
shengc / tf-lstm-crf-tagger
View on GitHub
TensorFlow Implementation For [Neural Architecture for Named Entity Recognition](https://arxiv.org/abs/1603.01360)
☆12Mar 4, 2018Updated 8 years ago