HLTCHKUST / chatgpt-evaluationView external linksLinks
This respository contains the code for extracting the test samples we used in our paper: "A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity"
☆81Nov 24, 2023Updated 2 years ago
Alternatives and similar repositories for chatgpt-evaluation
Users that are interested in chatgpt-evaluation are comparing it to the libraries listed below
Sorting:
- The implementation of the paper "Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters".☆17May 24, 2022Updated 3 years ago
- ☆15Dec 10, 2021Updated 4 years ago
- [ACL 2023] The code for our ACL'23 paper Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Pr…☆24Jun 1, 2024Updated last year
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- Detect hallucinated tokens for conditional sequence generation.☆64Apr 15, 2022Updated 3 years ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated last year
- [EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages☆23Feb 13, 2023Updated 3 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- Deep Learning with Multiple Objectives: 2021 edition☆10May 27, 2021Updated 4 years ago
- The code for paper "ProQA: Structural Prompt-based Pre-training for Unified Question Answering"☆11Feb 7, 2023Updated 3 years ago
- ☆37Dec 26, 2025Updated last month
- ☆10Sep 27, 2021Updated 4 years ago
- 🎭 Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"☆13Mar 26, 2024Updated last year
- ☆10Jan 28, 2024Updated 2 years ago
- Dataset and model in the paper "SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation"☆13Feb 14, 2022Updated 4 years ago
- Non-local Modeling for Image Quality Assessment☆13Dec 20, 2023Updated 2 years ago
- ☆10Jun 16, 2021Updated 4 years ago
- Deepfake cross-lingual evaluation dataset (DECRO) is constructed to evaluate the influence of language differences on deepfake detection.…☆15Sep 14, 2023Updated 2 years ago
- ☆13May 21, 2024Updated last year
- CAiRE in DialDoc21: Data Augmentation for Information-SeekingDialogue System☆11May 24, 2022Updated 3 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆82Apr 11, 2024Updated last year
- Versatile Generative Language Model☆25Oct 29, 2022Updated 3 years ago
- A collection of instruction data and scripts for machine translation.☆20Sep 23, 2023Updated 2 years ago
- Can ChatGPT really understand the opinions, sentiments, and emotions contained in the text? We provide a preliminary evaluation.☆54Sep 23, 2024Updated last year
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆53Jan 21, 2024Updated 2 years ago
- ☆31Apr 14, 2023Updated 2 years ago
- ☆15Oct 20, 2023Updated 2 years ago
- Towards Few-Shot Fact-Checking via Perplexity☆14Jun 11, 2021Updated 4 years ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆16Oct 14, 2024Updated last year
- Trials of pre-trained BERT models for the medical domain in Japanese.☆12Nov 21, 2020Updated 5 years ago
- ☆14Apr 2, 2023Updated 2 years ago
- Unified MultiWOZ evaluation scripts for the context-to-response task.☆59Oct 11, 2023Updated 2 years ago
- code for our EMNLP2020 paper: Multilevel Text Alignment with Cross-Document Attention by Xuhui Zhou, Nikolaos Pappas, and Noah A. Smith☆14May 18, 2021Updated 4 years ago
- ☆14Aug 21, 2025Updated 5 months ago
- Experiments codes for SIGKDD '22 paper "User-Event Graph Embedding Learning for Context-Aware Recommendation"☆11Aug 14, 2022Updated 3 years ago
- SOTA work about out-of-distribution detection☆14Mar 5, 2021Updated 4 years ago
- Official code for our COLING 2022 paper: In-Context Learning for Empathetic Dialogue Generation☆21Mar 1, 2023Updated 2 years ago
- ☆37Aug 20, 2024Updated last year
- 🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT☆192Apr 17, 2023Updated 2 years ago