taisazero / socratic-debugging-benchmarkLinks

The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice Debuggers to guide them towards discovering and fixing a buggy python program.

☆18

Alternatives and similar repositories for socratic-debugging-benchmark

Users that are interested in socratic-debugging-benchmark are comparing it to the libraries listed below

Sorting:

rosewang2008 / bridge
NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…
☆43Updated last year
reasoning-machines / prompt-lib
A set of utilities for running few-shot prompting experiments on large-language models
☆123Updated last year
minalee-research / coauthor-interface
☆99Updated last year
McGill-NLP / instruct-qa
Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"
☆86Updated last year
skywalker023 / fantom
👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"
☆56Updated last year
swarnaHub / ExplanationIntervention
[NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind
☆66Updated last year
mingdachen / SummScreen
SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)
☆38Updated 3 years ago
reasoning-machines / CoCoGen
Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)
☆85Updated 2 years ago
mukhal / GRACE
[EMNLP '23] Discriminator-Guided Chain-of-Thought Reasoning
☆49Updated last year
jlin816 / dialop
DialOp: Decision-oriented dialogue environments for collaborative language agents
☆110Updated 11 months ago
salesforce / factualNLG
Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"
☆59Updated 8 months ago
kernelmachine / silo-lm
SILO Language Models code repository
☆83Updated last year
allenai / CommaQA
Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents
☆24Updated 3 years ago
princeton-nlp / LM-Science-Tutor
☆46Updated last year
yuxiaw / OpenFactCheck
☆52Updated last year
allenai / marg-reviewer
Code/data for MARG (multi-agent review generation)
☆54Updated 2 weeks ago
dengyang17 / PACIFIC
PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance
☆14Updated last year
google-research-datasets / Synthetic-Persona-Chat
The Synthetic-Persona-Chat dataset is a synthetically generated persona-based dialogue dataset. It extends the original Persona-Chat data…
☆102Updated last year
sotopia-lab / awesome-social-agents
A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.
☆96Updated last year
kaistAI / Janus
[NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages
☆51Updated 2 months ago
oriyor / reasoning-on-cots
Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"
☆96Updated last year
debjitpaul / refiner
About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations" (EACL 2024). Do not hesitate t…
☆70Updated last year
OSU-NLP-Group / llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Updated last year
SALT-NLP / positive-frames
Data and code for the paper "Inducing Positive Perspectives with Text Reframing"
☆61Updated 2 years ago
rxlqn / awesome-llm-self-reflection
augmented LLM with self reflection
☆132Updated last year
facebookresearch / doc-storygen-v2
Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation
☆85Updated last year
allenai / persona-bias
☆26Updated last year
behavioral-data / Cognitive-Reframing
Codes and Datasets for our ACL 2023 paper on cognitive reframing of negative thoughts
☆64Updated 2 years ago
SALT-NLP / demonstrated-feedback
☆128Updated last year
tatsu-lab / opinions_qa
☆116Updated last year