HanNight / soft_self_consistency
Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"
☆14Updated last week
Related projects: ⓘ
- Self-Explore to avoid ️the p️️it! Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards☆39Updated 4 months ago
- [ACL 2024 NLP4ConvAI Oral] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system m…☆33Updated 3 months ago
- ☆46Updated 2 weeks ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆62Updated 3 months ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆51Updated 5 months ago
- This the implementation of LeCo☆16Updated 2 months ago
- Code for Suri: Multi-constraint instruction following for long-form text generation☆15Updated last week
- Code for Findings of EMNLP2023 paper "Coarse-to-Fine Dual Encoders are Better Frame Identification Learners"☆12Updated 11 months ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆14Updated 2 months ago
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆29Updated 2 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆48Updated 4 months ago
- Visual and Embodied Concepts evaluation benchmark☆21Updated 11 months ago
- ☆31Updated 3 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆84Updated 5 months ago
- [EMNLP 2023, Findings] GRACE: Discriminator-Guided Chain-of-Thought Reasoning☆41Updated 4 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆63Updated last week
- ☆13Updated 10 months ago
- Repo for Paper "Dual-Space Knowledge Distillation for Large Language Models".☆29Updated 2 weeks ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆28Updated 8 months ago
- BeHonest: Benchmarking Honesty in Large Language Models☆27Updated last month
- EMNLP 2023 Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts☆23Updated 10 months ago
- This is the official repository for the paper "EmoBench: Evaluating the Emotional Intelligence of Large Language Models"☆39Updated 6 months ago
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆23Updated 3 months ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆24Updated last month
- ☆28Updated 4 months ago
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers"☆30Updated last month
- Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts☆13Updated 2 weeks ago
- GPT as Human☆17Updated 8 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement☆21Updated last month
- ☆19Updated 10 months ago