sxswz213 / CRSEC
☆38Updated 7 months ago
Alternatives and similar repositories for CRSEC:
Users that are interested in CRSEC are comparing it to the libraries listed below
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆31Updated 6 months ago
- ☆56Updated 3 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆35Updated 11 months ago
- ☆24Updated this week
- A collection of resources that investigate social agents.☆139Updated 3 weeks ago
- ☆129Updated 3 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆97Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 4 months ago
- ☆19Updated 5 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆132Updated 5 months ago
- HiSim: A Hybrid Social Media Simulation Framework☆32Updated 9 months ago
- A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.☆83Updated 10 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆112Updated 6 months ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆93Updated 11 months ago
- website repo for agent-based social movement simulation☆19Updated 9 months ago
- ☆30Updated last year
- ☆41Updated 2 weeks ago
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆39Updated last year
- ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.☆42Updated 9 months ago
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>☆48Updated last year
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆301Updated 8 months ago
- Yelp Simulator for WWW'25 AgentSociety Challenge☆74Updated last week
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆61Updated 11 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆57Updated 5 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆72Updated 7 months ago
- ☆54Updated 5 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆133Updated last month
- This is the code for the ICLR 2023 paper "Leveraging Large Language Models for Multiple Choice Question Answering."☆39Updated 2 years ago
- ☆19Updated last year
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆48Updated last year