CUHK-ARISE/EmotionBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CUHK-ARISE/EmotionBench)

CUHK-ARISE / EmotionBench

Code and data for the paper: Apathetic or Empathetic? Evaluating LLMs' Emotional Alignments with Humans

☆122

Alternatives and similar repositories for EmotionBench

Users that are interested in EmotionBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Jarviswang94 / Multilingual_safety_benchmark
View on GitHub
Multilingual safety benchmark for Large Language Models
☆53Sep 1, 2024Updated last year
Amaodemao / BiasPainter
View on GitHub
basically all the things I used for this article
☆24Jan 8, 2025Updated last year
CUHK-ARISE / GAMABench
View on GitHub
Code and data for the paper: Competing Large Language Models in Multi-Agent Gaming Environments
☆98Jan 26, 2026Updated 5 months ago
WebPAI / MRWeb
View on GitHub
☆34Mar 11, 2025Updated last year
yxwan123 / LogicAsker
View on GitHub
☆33Feb 19, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
CUHK-ARISE / PsychoBench
View on GitHub
Code and data for the paper: On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs
☆135Jan 24, 2026Updated 6 months ago
yxwan123 / BiasAsker
View on GitHub
☆40Jan 9, 2025Updated last year
duyichao / MINETrans-IWSLT23
View on GitHub
Official implementation of our IWSLT 2023 paper "The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Tra…
☆16Jul 14, 2023Updated 3 years ago
CriticBench / CriticBench
View on GitHub
[ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
☆31Mar 5, 2024Updated 2 years ago
penguinnnnn / awesome-llm-and-society
View on GitHub
Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.
☆51Nov 3, 2023Updated 2 years ago
xyliu-cs / RISE
View on GitHub
[NeurIPS'25] Official Implementation of RISE (Reinforcing Reasoning with Self-Verification)
☆33Aug 8, 2025Updated 11 months ago
WebPAI / ComUICoder
View on GitHub
[SIGKDD 2026] ComUICoder: Component-based Reusable UI Code Generation for Complex Websites via Semantic Segmentation and Element-wise Fee…
☆24Jun 2, 2026Updated last month
JohnnyPeng18 / HiTyper
View on GitHub
This is the tool released in ICSE 2022 paper "Static Inference Meets Deep Learning: A Hybrid Type Inference Approach for Python"
☆45Oct 19, 2023Updated 2 years ago
CUHK-Shenzhen-SE / D4C
View on GitHub
[ICSE'25] Aligning the Objective of LLM-based Program Repair
☆24Mar 8, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
WebPAI / EfficientUICoder
View on GitHub
[FSE 2026] EfficientUICoder: Efficient MLLM-based UI Code Generation via Input and Output Token Compression
☆26May 5, 2026Updated 2 months ago
logpai / AutoLog
View on GitHub
AutoLog: A Log Sequence Synthesis Framework for Anomaly Detection [ASE'23]
☆41Feb 20, 2024Updated 2 years ago
kite99520 / DialSummEval
View on GitHub
Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"
☆14Jul 22, 2025Updated last year
Sahandfer / EmoBench
View on GitHub
[ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models
☆117May 16, 2025Updated last year
OpsPAI / aid
View on GitHub
Code for ASE'21 paper "AID: Efficient Prediction of Aggregated Intensity of Dependency in Large-scale Cloud Systems"
☆15Nov 2, 2021Updated 4 years ago
CUHK-Shenzhen-SE / RetromorphicTesting
View on GitHub
☆11Jan 19, 2025Updated last year
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆12Mar 18, 2023Updated 3 years ago
choidami / inductive-oocr
View on GitHub
☆16Mar 22, 2025Updated last year
hexuandeng / Mono4SiMT
View on GitHub
The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉
☆12Jul 19, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Gringham / explainable-metrics-machine-translation
View on GitHub
explainable-machine-translation-metrics
☆12Jul 15, 2022Updated 4 years ago
zwhe99 / FeedbackMT
View on GitHub
Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"
☆22Jun 28, 2024Updated 2 years ago
WebPAI / DesignBench
View on GitHub
DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation
☆50May 25, 2026Updated last month
xyliu-cs / StateLM
View on GitHub
[ICLR'26] Official Open-source Implementation of StateLM
☆20Feb 13, 2026Updated 5 months ago
YintongHuo / SemParser
View on GitHub
☆19Oct 25, 2023Updated 2 years ago
zwhe99 / LLM-MT-Eval
View on GitHub
{DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}
☆14Jun 18, 2023Updated 3 years ago
zwhe99 / WMT22-En-Liv
View on GitHub
[WMT 2022] Implementation of TAL-SJTU's system for WMT22 English-Livonian
☆23May 4, 2023Updated 3 years ago
gydpku / PPTC
View on GitHub
PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
☆61Feb 29, 2024Updated 2 years ago
zwhe99 / RaSA
View on GitHub
[ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation
☆10May 19, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Vision-CAIR / affectiveVisDial
View on GitHub
☆13Jul 17, 2024Updated 2 years ago
CUHK-Shenzhen-SE / UTBoost
View on GitHub
[ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
☆36Aug 12, 2025Updated 11 months ago
Skytliang / SpyGame
View on GitHub
SpyGame: An interactive multi-agent framework to evaluate intelligence with large language models :D
☆15Nov 9, 2023Updated 2 years ago
Atrewin / PGen
View on GitHub
Implementation of our paper "Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation". Accepted in EACL …
☆11May 22, 2023Updated 3 years ago
RobustNLP / TestNER
View on GitHub
A toolkit for testing and improving named entity recognition [ESEC/FSE'23]
☆11Aug 31, 2023Updated 2 years ago
WENGSYX / ControlLM
View on GitHub
ControlLM is a method to control the personality traits and behaviors of language models in real-time at inference without costly trainin…
☆21Nov 6, 2024Updated last year
kennymckormick / ARAS-Dataset
View on GitHub
☆11Nov 5, 2024Updated last year