THU-KEG/EvaluationPapers4ChatGPT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/THU-KEG/EvaluationPapers4ChatGPT)

THU-KEG / EvaluationPapers4ChatGPT

Resource, Evaluation and Detection Papers for ChatGPT

☆456

Alternatives and similar repositories for EvaluationPapers4ChatGPT

Users that are interested in EvaluationPapers4ChatGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zjunlp / Prompt4ReasoningPapers
View on GitHub
[ACL 2023] Reasoning with Language Model Prompting: A Survey
☆1,008May 21, 2025Updated last year
krystalan / chatgpt_as_nlg_evaluator
View on GitHub
Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study
☆43Mar 8, 2023Updated 3 years ago
THU-KEG / ChatLog
View on GitHub
⏳ ChatLog: Recording and Analysing ChatGPT Across Time
☆104May 30, 2024Updated 2 years ago
shizhediao / ChatGPTPapers
View on GitHub
Must-read papers, related blogs and API tools on the pre-training and tuning methods for ChatGPT.
☆333Aug 10, 2023Updated 2 years ago
dqxiu / ICL_PaperList
View on GitHub
Paper List for In-context Learning 🌷
☆876Oct 8, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Timothyxxx / Chain-of-ThoughtsPapers
View on GitHub
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
☆2,105Oct 5, 2023Updated 2 years ago
SEU-COIN / LLMPapers
View on GitHub
Papers & Works for large languange models (OpenAI GPT-4, Meta Llama, etc.).
☆313Nov 15, 2025Updated 8 months ago
sunlab-osu / Understanding-CoT
View on GitHub
☆88Jun 1, 2023Updated 3 years ago
neulab / knn-transformers
View on GitHub
PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…
☆288Oct 20, 2022Updated 3 years ago
WeOpenML / PandaLM
View on GitHub
☆926May 22, 2024Updated 2 years ago
Timothyxxx / EnvInteractiveLMPapers
View on GitHub
Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…
☆128Jul 26, 2023Updated 3 years ago
thunlp / ToolLearningPapers
View on GitHub
☆923Jul 24, 2024Updated 2 years ago
zcgzcgzcg1 / ACL2022_KnowledgeNLP_Tutorial
View on GitHub
Materials for ACL-2022 tutorial: Knowledge-Augmented Methods for Natural Language Processing
☆286Aug 8, 2022Updated 3 years ago
FranxYao / chain-of-thought-hub
View on GitHub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
☆2,776Aug 4, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jeffhj / LM-reasoning
View on GitHub
This repository contains a collection of papers and resources on Reasoning in Large Language Models.
☆572Nov 13, 2023Updated 2 years ago
RUCKBReasoning / GLM-Dialog
View on GitHub
☆59Aug 1, 2023Updated 2 years ago
Spico197 / awesome-lm-evaluation
View on GitHub
🩺 A collection of ChatGPT evaluation reports on various bechmarks.
☆50Mar 28, 2023Updated 3 years ago
SinclairCoder / Instruction-Tuning-Papers
View on GitHub
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
☆769Jul 20, 2023Updated 3 years ago
txsun1997 / LMaaS-Papers
View on GitHub
Awesome papers on Language-Model-as-a-Service (LMaaS)
☆545May 14, 2024Updated 2 years ago
FranxYao / GPT-Bargaining
View on GitHub
Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback
☆207May 24, 2023Updated 3 years ago
thunlp / PromptPapers
View on GitHub
Must-read papers on prompt-based tuning for pre-trained language models.
☆4,324Jul 17, 2023Updated 3 years ago
Timothyxxx / RetrivalLMPapers
View on GitHub
Paper collections of retrieval-based (augmented) language model.
☆233May 24, 2024Updated 2 years ago
HillZhang1999 / llm-hallucination-survey
View on GitHub
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …
☆1,085Sep 27, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
MLGroupJLU / LLM-eval-survey
View on GitHub
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
☆1,610Apr 17, 2026Updated 3 months ago
Hello-SimpleAI / chatgpt-comparison-detection
View on GitHub
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
☆1,420Dec 1, 2023Updated 2 years ago
xu1998hz / InstructScore_SEScore3
View on GitHub
First explanation metric (diagnostic report) for text generation evaluation
☆62Mar 3, 2025Updated last year
AetherCortex / Llama-X
View on GitHub
Open Academic Research on Improving LLaMA to SOTA LLM
☆1,605Aug 30, 2023Updated 2 years ago
JasonForJoy / Model-Editing-Hurt
View on GitHub
EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue
☆37May 26, 2025Updated last year
zjunlp / KnowledgeEditingPapers
View on GitHub
Must-read Papers on Knowledge Editing for Large Language Models.
☆1,242Jun 25, 2026Updated last month
princeton-nlp / ALCE
View on GitHub
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
☆523Oct 9, 2024Updated last year
txsun1997 / MOSS
View on GitHub
MOSS is a conversational language model like ChatGPT.
☆740Apr 20, 2023Updated 3 years ago
THU-KEG / KoLA
View on GitHub
[ICLR24] The open-source repo of THU-KEG's KoLA benchmark.
☆57Sep 28, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
THU-KEG / Awesome_MOOCs
View on GitHub
This is a repo listing some must-read papers on *AI-driven MOOCs* or *Intelligent Education* published in recent years, mainly contribute…
☆18Jun 8, 2022Updated 4 years ago
Romainpkq / ChatGPT4MT
View on GitHub
🎁[ChatGPT4MT] Towards Making the Most of ChatGPT for Machine Translation
☆73Mar 25, 2024Updated 2 years ago
pfliu-nlp / NLPedia-Pretrain
View on GitHub
☆402Oct 12, 2021Updated 4 years ago
i-Eval / FairEval
View on GitHub
☆145Sep 10, 2023Updated 2 years ago
xlang-ai / xlang-paper-reading
View on GitHub
Paper collection on building and evaluating language model agents via executable language grounding
☆364Apr 29, 2024Updated 2 years ago
sail-sg / symbolic-instruction-tuning
View on GitHub
The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".
☆65Apr 18, 2023Updated 3 years ago
GaryYufei / AlignLLMHumanSurvey
View on GitHub
Aligning Large Language Models with Human: A Survey
☆742Sep 11, 2023Updated 2 years ago