THU-KEG / EvaluationPapers4ChatGPTView external linksLinks
Resource, Evaluation and Detection Papers for ChatGPT
☆456Mar 21, 2024Updated last year
Alternatives and similar repositories for EvaluationPapers4ChatGPT
Users that are interested in EvaluationPapers4ChatGPT are comparing it to the libraries listed below
Sorting:
- [ACL 2023] Reasoning with Language Model Prompting: A Survey☆996May 21, 2025Updated 8 months ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Mar 8, 2023Updated 2 years ago
- Paper List for In-context Learning 🌷☆875Oct 8, 2024Updated last year
- A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".☆2,100Oct 5, 2023Updated 2 years ago
- Must-read papers, related blogs and API tools on the pre-training and tuning methods for ChatGPT.☆325Aug 10, 2023Updated 2 years ago
- Papers & Works for large languange models (OpenAI GPT-4, Meta Llama, etc.).☆315Nov 15, 2025Updated 3 months ago
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆286Oct 20, 2022Updated 3 years ago
- This repository contains a collection of papers and resources on Reasoning in Large Language Models.☆567Nov 13, 2023Updated 2 years ago
- ☆921May 22, 2024Updated last year
- ☆88Jun 1, 2023Updated 2 years ago
- ☆917Jul 24, 2024Updated last year
- Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…☆129Jul 26, 2023Updated 2 years ago
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,768Aug 4, 2024Updated last year
- Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).☆766Jul 20, 2023Updated 2 years ago
- ⏳ ChatLog: Recording and Analysing ChatGPT Across Time☆103May 30, 2024Updated last year
- Materials for ACL-2022 tutorial: Knowledge-Augmented Methods for Natural Language Processing☆286Aug 8, 2022Updated 3 years ago
- 🎁[ChatGPT4MT] Towards Making the Most of ChatGPT for Machine Translation☆73Mar 25, 2024Updated last year
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆207May 24, 2023Updated 2 years ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Mar 28, 2023Updated 2 years ago
- Paper collections of retrieval-based (augmented) language model.☆232May 24, 2024Updated last year
- Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …☆1,076Sep 27, 2025Updated 4 months ago
- Awesome papers on Language-Model-as-a-Service (LMaaS)☆547May 14, 2024Updated last year
- The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".☆1,594Jun 3, 2025Updated 8 months ago
- Must-read papers on prompt-based tuning for pre-trained language models.☆4,297Jul 17, 2023Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Mar 3, 2025Updated 11 months ago
- Open Academic Research on Improving LLaMA to SOTA LLM☆1,611Aug 30, 2023Updated 2 years ago
- ☆59Aug 1, 2023Updated 2 years ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆511Oct 9, 2024Updated last year
- Paper collection on building and evaluating language model agents via executable language grounding☆365Apr 29, 2024Updated last year
- Must-read Papers on Knowledge Editing for Large Language Models.☆1,212Jul 12, 2025Updated 7 months ago
- Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥☆1,341Dec 1, 2023Updated 2 years ago
- Open source code and data for AAAI 2022 Oral Paper "Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding"☆35May 26, 2024Updated last year
- EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560☆58Feb 28, 2025Updated 11 months ago
- An Evaluation of ChatGPT on Information Extraction task, including Named Entity Recognition (NER), Relation Extraction (RE), Event Extrac…☆134Jan 17, 2024Updated 2 years ago
- ☆290Dec 2, 2022Updated 3 years ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆82Apr 10, 2023Updated 2 years ago
- ☆772Jun 13, 2024Updated last year
- Aligning Large Language Models with Human: A Survey☆742Sep 11, 2023Updated 2 years ago
- paper list on reasoning in NLP☆195Apr 7, 2025Updated 10 months ago