Value4AI / gpv

[AAAI 2025] Measuring Human and AI Values Based on Generative Psychometrics with Large Language Models.

☆35

Alternatives and similar repositories for gpv:

Users that are interested in gpv are comparing it to the libraries listed below

siyuyuan / evoagent
Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"
☆93Updated 6 months ago
microsoft / competeai
[ICML 2024 Oral] A framework for society simulation that supports complex simulation, for example: multi-scene.
☆71Updated 9 months ago
David-Li0406 / Preference-Leakage
☆45Updated last month
Geaming2002 / Ruler
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
☆35Updated 6 months ago
zjunlp / KnowSelf
Agentic Knowledgeable Self-awareness
☆50Updated last week
MingLiiii / Layer_Gradient
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
☆63Updated last month
MTU-Bench-Team / MTU-Bench
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models
☆41Updated 2 months ago
sail-sg / FlowReasoner
☆63Updated this week
git-disl / Virus
This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"
☆46Updated 2 months ago
orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆71Updated 7 months ago
Yu-Fangxu / FoR
Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples
☆84Updated last month
NineAbyss / S2R
This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"
☆62Updated this week
THU-KEG / Agentic-Reward-Modeling
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆86Updated last month
dinobby / Symbolic-MoE
The code implementation of Symbolic-MoE
☆27Updated last month
NuoJohnChen / JudgeLRM
☆23Updated last week
tigerchen52 / awesome_role_of_small_models
a curated list of the role of small models in the LLM era
☆100Updated 7 months ago
yanweiyue / AgentPrune
☆53Updated last month
shenao-zhang / reward-augmented-preference
The official implementation of Preference Data Reward-Augmentation.
☆17Updated 6 months ago
XinyuanLu00 / TART
This is the repository for NAACL'25 paper "TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning"
☆52Updated 6 months ago
jiangjiechen / auction-arena
Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…
☆44Updated last year
zjunlp / WorfBench
[ICLR 2025] Benchmarking Agentic Workflow Generation
☆79Updated 2 months ago
uclaml / COPS
The official implementation of Cross-Task Experience Sharing (COPS)
☆22Updated 6 months ago
clinicalml / co-llm
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆112Updated 11 months ago
xiaowu0162 / LongMemEval
Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)
☆70Updated 2 months ago
menhguin / minp_paper
Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper
☆32Updated last month
bingreeky / MaAS
Code of paper: Multi-agent Architecture Search via Agentic Supernet
☆45Updated last month
Quehry / HelloBench
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
☆40Updated 5 months ago
chengyou-jia / AgentStore
☆36Updated 4 months ago
WeiminXiong / MPO
MPO: Boosting LLM Agents with Meta Plan Optimization
☆50Updated last month
shuhao02 / RouterDC
The code of RouterDC
☆58Updated last week