UKPLab / emnlp2024-code-promptingLinks

Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. EMNLP 2024

☆25

Alternatives and similar repositories for emnlp2024-code-prompting

Users that are interested in emnlp2024-code-prompting are comparing it to the libraries listed below

Sorting:

psunlpgroup / ReaLMistake
This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".
☆30Updated 10 months ago
ShiZhengyan / PowerfulPromptFT
[NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…
☆74Updated last year
xxxiaol / QRData
Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data
☆41Updated 4 months ago
alon-albalak / FLAD
Few-shot Learning with Auxiliary Data
☆28Updated last year
kaistAI / InstructIR
IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…
☆32Updated last year
cognitiveailab / BYTESIZED32
Byte-sized text games for code generation tasks on virtual environments
☆19Updated last year
allenai / dream
☆24Updated 10 months ago
JeremyAlain / imitation_learning_from_language_feedback
This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆27Updated 2 years ago
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆27Updated 5 months ago
petezh / OpenD5
Tasks for describing differences between text distributions.
☆16Updated 11 months ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆43Updated last year
tml-epfl / icl-alignment
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆30Updated 5 months ago
cambridgeltl / PairS
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)
☆47Updated 5 months ago
Leezekun / MacRAG
☆16Updated 2 weeks ago
HKUNLP / ProGen
[EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.
☆27Updated 2 years ago
martin-wey / CodeUltraFeedback
CodeUltraFeedback: aligning large language models to coding preferences
☆71Updated last year
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆30Updated 2 weeks ago
nlp-uoregon / ullme
☆20Updated 3 months ago
technion-cs-nlp / hallucination-mitigation
☆22Updated 6 months ago
archiki / ReCEval
Supporting code for ReCEval paper
☆29Updated 10 months ago
orionw / FollowIR
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆45Updated last year
Tebmer / Rereading-LLM-Reasoning
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…
☆26Updated 7 months ago
allenai / CommaQA
Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents
☆24Updated 3 years ago
allenai / super-benchmark
☆45Updated 3 months ago
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 5 months ago
limenlp / safer-instruct
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Updated last year
benpry / chain-of-thought-metaphor
This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…
☆14Updated 2 years ago
kumar-shridhar / Screws
SCREWS: A Modular Framework for Reasoning with Revisions
☆27Updated last year
allenai / marg-reviewer
Code/data for MARG (multi-agent review generation)
☆44Updated 8 months ago
EleutherAI / semantic-memorization
☆44Updated 7 months ago