jasonyux / GDPZeroLinks
☆23Updated 2 years ago
Alternatives and similar repositories for GDPZero
Users that are interested in GDPZero are comparing it to the libraries listed below
Sorting:
- ☆31Updated 5 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆165Updated last year
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆36Updated last year
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆20Updated last year
- ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.☆60Updated last year
- Collection of papers, benchmarks and newest trends in the domain of End-to-end ToDs☆13Updated last year
- ☆75Updated last year
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆139Updated last year
- ☆17Updated 11 months ago
- The awesome agents in the era of large language models☆69Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆56Updated 11 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆117Updated last year
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆88Updated 5 months ago
- Official implementation of our paper "Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration".☆13Updated 11 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆62Updated last year
- ☆51Updated 5 months ago
- Collection of papers for scalable automated alignment.☆94Updated last year
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆82Updated 9 months ago
- Code and Results of the Paper: On the Reliability of Psychological Scales on Large Language Models☆30Updated last year
- ☆31Updated 5 months ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆49Updated last year
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆52Updated 5 months ago
- ☆25Updated 2 years ago
- Official code for the paper Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception. The code is based on t…☆19Updated 2 months ago
- Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)☆30Updated last week
- Code for Research Project TLDR☆23Updated 3 months ago
- Safety-J: Evaluating Safety with Critique☆16Updated last year
- Koishi's Day 2025 Paper (NeurIPS 2025): "Codifying Character Logic in Role-Playing"☆11Updated 2 months ago
- ☆21Updated 9 months ago