Neph0s / CoSERLinks
Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"
☆90Updated last month
Alternatives and similar repositories for CoSER
Users that are interested in CoSER are comparing it to the libraries listed below
Sorting:
- Official code for the paper: InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews (previo…☆76Updated last week
- RoleInteract: Evaluating the Social Interaction of Role-Playing Agents☆55Updated 7 months ago
- ☆151Updated this week
- Reformatted Alignment☆114Updated 8 months ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆61Updated this week
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆249Updated 5 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆68Updated 3 weeks ago
- ☆47Updated 5 months ago
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆69Updated 4 months ago
- ☆82Updated last year
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆179Updated 2 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆97Updated this week
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆248Updated 3 weeks ago
- ☆150Updated last month
- repository for CharacterChat, a personalized social support system☆72Updated 10 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆73Updated last month
- ☆49Updated last year
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆61Updated 8 months ago
- ☆240Updated last week
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆39Updated 10 months ago
- Awesome papers for role-playing with language models☆188Updated 7 months ago
- On Memorization of Large Language Models in Logical Reasoning☆65Updated 2 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆106Updated last month
- A Bilingual Role Evaluation Benchmark for Large Language Models☆40Updated last year
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆100Updated last week
- The official repository of the Omni-MATH benchmark.☆83Updated 5 months ago
- Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.☆127Updated 4 months ago
- [ACL 2025, Main Conference] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process☆28Updated 10 months ago
- ☆50Updated this week
- Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"☆51Updated last year