taisazero / socratic-debugging-benchmarkLinks
The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice Debuggers to guide them towards discovering and fixing a buggy python program.
โ18Updated last year
Alternatives and similar repositories for socratic-debugging-benchmark
Users that are interested in socratic-debugging-benchmark are comparing it to the libraries listed below
Sorting:
- NAACL 2024. Code & Dataset for "๐ Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakeโฆโ45Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"โ86Updated last year
- โ100Updated last year
- Codes and Datasets for our ACL 2023 paper on cognitive reframing of negative thoughtsโ67Updated 2 years ago
- Inspecting and Editing Knowledge Representations in Language Modelsโ119Updated 2 years ago
- A Computational Framework for Behavioral Assessment of LLM Therapistsโ38Updated last year
- ๐งฎ MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023โ74Updated 4 months ago
- โ29Updated 3 years ago
- The Prism Alignment Projectโ89Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".โ165Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"โ102Updated 2 years ago
- โ47Updated 4 months ago
- โ117Updated last year
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"โ109Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersโ136Updated last year
- Resources for cultural NLP researchโ113Updated 4 months ago
- Official repository for the AnnoMI dataset: the first public collection of expert-annotated MI transcripts.โ81Updated 2 years ago
- โ52Updated last year
- Source code for the paper "Active Prompting with Chain-of-Thought for Large Language Models"โ247Updated last year
- Data and code for the paper "Inducing Positive Perspectives with Text Reframing"โ61Updated 2 years ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"โ61Updated last year
- ๐ฒ Code for our EMNLP 2023 paper - ๐ "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Modeโฆโ54Updated 2 years ago
- A curated list of research papers and resources on Cultural LLM.โ53Updated last year
- โ82Updated last year
- [NeurIPS 2023] Codebase for the paper: "Guiding Large Language Models with Directional Stimulus Prompting"โ113Updated 2 years ago
- โ58Updated last year
- Code and data associated with the AmbiEnt dataset in "We're Afraid Language Models Aren't Modeling Ambiguity" (Liu et al., 2023)โ64Updated 2 years ago
- Token-level Reference-free Hallucination Detectionโ98Updated 2 years ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.โ156Updated 2 years ago
- RARR: Researching and Revising What Language Models Say, Using Language Modelsโ51Updated 2 years ago