NUSTM / LLMs-Waver-In-JudgmentsLinks
☆12Updated last year
Alternatives and similar repositories for LLMs-Waver-In-Judgments
Users that are interested in LLMs-Waver-In-Judgments are comparing it to the libraries listed below
Sorting:
- Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue (ACL Findings 2023)☆21Updated this week
- Findings of EMNLP 2023: InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspe…☆14Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Updated last year
- Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)☆16Updated last year
- Code and Results of the Paper: On the Reliability of Psychological Scales on Large Language Models☆30Updated last year
- ACL'2023: Multi-Task Pre-Training of Modular Prompt for Few-Shot Learning☆40Updated 3 years ago
- ☆16Updated 3 years ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆83Updated last year
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆41Updated last year
- ☆38Updated last year
- ☆42Updated last year
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Updated last year
- ☆27Updated 2 years ago
- EMNLP2022: Learning Robust Representations for Continual Relation Extraction via Adversarial Class Augmentation☆14Updated 3 years ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Updated 7 months ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Updated 2 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆11Updated 2 years ago
- ☆64Updated 2 years ago
- Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)☆30Updated 3 weeks ago
- [EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning☆17Updated 2 years ago
- Code and data for "Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue" (ACM TOIS)☆12Updated 3 weeks ago
- Safety-J: Evaluating Safety with Critique☆16Updated last year
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Updated 2 years ago
- ☆32Updated last year
- my commonly-used tools☆63Updated 10 months ago
- BeHonest: Benchmarking Honesty in Large Language Models☆34Updated last year
- ☆25Updated 2 years ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Updated 2 years ago
- self-adaptive in-context learning☆45Updated 2 years ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆40Updated 2 years ago