vsamuel2003 / PersonaGymView external linksLinks
[EMNLP 2025] Dataset and Code of "PersonaGym: Evaluating Persona Agents and LLMs"
☆38Aug 21, 2025Updated 5 months ago
Alternatives and similar repositories for PersonaGym
Users that are interested in PersonaGym are comparing it to the libraries listed below
Sorting:
- ☆50May 19, 2025Updated 8 months ago
- ☆108Nov 7, 2024Updated last year
- QueryGPT-ADK is an open-source, multi-agent system for natural language to SQL query generation and explanation. It leverages LLMs and v…☆16Jul 23, 2025Updated 6 months ago
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- ☆10Sep 17, 2022Updated 3 years ago
- Fortifying Toxic Speech Detectors Against Veiled Toxicity☆11Oct 21, 2020Updated 5 years ago
- A non-distributed reference implementation of Facebook's read-optimized graph data store, TAO☆11May 25, 2020Updated 5 years ago
- Replication code for "The Structure of Toxic Conversations on Twitter" (WWW'21)☆10May 25, 2021Updated 4 years ago
- ☆12Dec 11, 2025Updated 2 months ago
- Follow Me: Conversation Planning for Target-driven Recommendation Dialogue Systems☆11Aug 1, 2023Updated 2 years ago
- Nano vLLM☆12Jun 26, 2025Updated 7 months ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Apr 11, 2024Updated last year
- ☆10Sep 9, 2024Updated last year
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- ☆12Oct 20, 2020Updated 5 years ago
- Adaptation of babyagi in the Open AEA framework☆15May 11, 2023Updated 2 years ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- ☆11Jan 10, 2024Updated 2 years ago
- Fork of Bliss☆14Dec 13, 2025Updated 2 months ago
- ☆13Jun 4, 2024Updated last year
- Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.☆12Sep 30, 2021Updated 4 years ago
- Slackbot for Quivr 🧠☆15Jan 5, 2026Updated last month
- [ACL 2024] Dataset and Code of "ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction…☆16Jun 10, 2024Updated last year
- K-Means algorithm in the Poincare Disk Model☆15Nov 12, 2018Updated 7 years ago
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆11May 19, 2023Updated 2 years ago
- Entity-Aware Dual Co-Attention Network for Fake News Detection, EACL 2023 Findings☆10Jun 11, 2023Updated 2 years ago
- ☆13Oct 12, 2024Updated last year
- The project on Conversational Aspect Sentiment Analysis (CASA)☆13Oct 8, 2022Updated 3 years ago
- Download AudioSet for Vision-Audio-Text Pre-training☆13May 16, 2022Updated 3 years ago
- ☆11May 4, 2022Updated 3 years ago
- Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)☆12May 10, 2021Updated 4 years ago
- A C++ package for streaming submodular function maximization with Python bindings☆12Jul 12, 2021Updated 4 years ago
- This is the code for Coupled-translation Fusion Network.☆11Dec 2, 2021Updated 4 years ago
- Code for Salesforce Research paper, CASPI: Causal-aware Safe Policy Improvement for Task-oriented dialogue - https://arxiv.org/abs/2103.0…☆14Jul 24, 2023Updated 2 years ago
- Kim, J., Evans, J., & Schein, A. (2025). Linear Representations of Political Perspective Emerge in Large Language Models. ICLR.☆24Mar 27, 2025Updated 10 months ago
- ☆16Jun 21, 2017Updated 8 years ago
- Primus zktls sdk☆32Jan 28, 2026Updated 2 weeks ago
- ☆14Feb 9, 2023Updated 3 years ago