[EMNLP 2025] Dataset and Code of "PersonaGym: Evaluating Persona Agents and LLMs"
☆37Aug 21, 2025Updated 6 months ago
Alternatives and similar repositories for PersonaGym
Users that are interested in PersonaGym are comparing it to the libraries listed below
Sorting:
- ☆50May 19, 2025Updated 9 months ago
- ☆111Nov 7, 2024Updated last year
- AbstainQA, ACL 2024☆29Feb 4, 2026Updated last month
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- QueryGPT-ADK is an open-source, multi-agent system for natural language to SQL query generation and explanation. It leverages LLMs and v…☆16Jul 23, 2025Updated 7 months ago
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- Fortifying Toxic Speech Detectors Against Veiled Toxicity☆11Oct 21, 2020Updated 5 years ago
- Block facebook from knowing you're typing a comment or message☆10Jun 4, 2017Updated 8 years ago
- 智付通API串接☆20Apr 20, 2023Updated 2 years ago
- Replication code for "The Structure of Toxic Conversations on Twitter" (WWW'21)☆10May 25, 2021Updated 4 years ago
- ☆12Dec 11, 2025Updated 2 months ago
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Oct 3, 2023Updated 2 years ago
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Apr 11, 2024Updated last year
- Nano vLLM☆12Jun 26, 2025Updated 8 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- ☆11Apr 13, 2023Updated 2 years ago
- [MICCAI-2023]Visual-Attribute Prompt Learning for Progressive Mild Cognitive Impairment Prediction☆15Dec 12, 2023Updated 2 years ago
- ☆11Oct 16, 2023Updated 2 years ago
- ☆12Oct 20, 2020Updated 5 years ago
- Code for a model-based version of Constrained Policy Optimization☆11May 6, 2021Updated 4 years ago
- Reproducible research paper in the journal Archaeology in Oceania☆16Jan 18, 2012Updated 14 years ago
- Dataset for Conversation Semantic Role Labeling☆11Aug 26, 2021Updated 4 years ago
- In this repository, I place my solution for the exercises in multiple famous math textbooks, including Stochastic Differential Equation, …☆13Nov 13, 2023Updated 2 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- Download AudioSet for Vision-Audio-Text Pre-training☆13May 16, 2022Updated 3 years ago
- Fork of Bliss☆14Dec 13, 2025Updated 2 months ago
- Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.☆12Sep 30, 2021Updated 4 years ago
- Apache Paimon Python The Python implementation of Apache Paimon.☆18Jan 7, 2026Updated 2 months ago
- ☆13Jun 4, 2024Updated last year
- A PyTorch implementation of "Meta-Amortized Variational Inference and Learning" (https://arxiv.org/abs/1902.01950)☆14Mar 31, 2020Updated 5 years ago
- ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language Models☆16Sep 27, 2024Updated last year
- K-Means algorithm in the Poincare Disk Model☆15Nov 12, 2018Updated 7 years ago
- ☆15Aug 25, 2021Updated 4 years ago
- ☆14Oct 12, 2024Updated last year
- This is the code for Coupled-translation Fusion Network.☆11Dec 2, 2021Updated 4 years ago
- Try several decodings on a string or file.☆13Nov 23, 2025Updated 3 months ago
- A C++ package for streaming submodular function maximization with Python bindings☆12Jul 12, 2021Updated 4 years ago
- CamRest676 is an English data set, I translate it into Chinese for training nlu.☆12Dec 20, 2017Updated 8 years ago
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆16Feb 11, 2023Updated 3 years ago