mayank31398 / pseudo-code-instructions
Pseudo-code Instructions dataset
☆24Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for pseudo-code-instructions
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆43Updated 10 months ago
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆79Updated last year
- ☆33Updated 2 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆48Updated 8 months ago
- ☆44Updated 2 months ago
- code for "Natural Language to Code Translation with Execution"☆39Updated 2 years ago
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)☆86Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆41Updated 9 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆78Updated 2 months ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆75Updated 6 months ago
- Evaluate the Quality of Critique☆35Updated 5 months ago
- Code and data for the FACTOR paper☆38Updated 11 months ago
- About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations" (EACL 2024). Do not hesitate t…☆66Updated 8 months ago
- [EMNLP 2023, Findings] GRACE: Discriminator-Guided Chain-of-Thought Reasoning☆44Updated 3 weeks ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆51Updated 5 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆156Updated 6 months ago
- Supporting code for ReCEval paper☆26Updated last month
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆71Updated 2 years ago
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆24Updated last year
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆56Updated 2 weeks ago
- This repository contains data, code and models for contextual noncompliance.☆18Updated 3 months ago
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆29Updated last year
- ☆18Updated last year
- ☆75Updated last year
- ☆63Updated 2 years ago
- ☆105Updated 3 months ago
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆61Updated last year
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆119Updated 3 weeks ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆28Updated 4 months ago
- Inspecting and Editing Knowledge Representations in Language Models☆107Updated last year