microsoft / DataScienceProblemsLinks
A repository containing the Jupyter notebook code generation benchmark.
☆59Updated 3 years ago
Alternatives and similar repositories for DataScienceProblems
Users that are interested in DataScienceProblems are comparing it to the libraries listed below
Sorting:
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆43Updated 2 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆48Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆42Updated 3 months ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- [EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code☆76Updated 11 months ago
- Embedding Recycling for Language models☆38Updated last year
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆55Updated 2 years ago
- ☆39Updated 2 years ago
- In-context Example Selection with Influences☆15Updated 2 years ago
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆72Updated 2 years ago
- ☆48Updated last year
- ☆75Updated 2 months ago
- AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems☆46Updated 2 years ago
- Code for generating the JuICe dataset.☆37Updated 3 years ago
- ☆53Updated last year
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- ☆117Updated 10 months ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- ☆34Updated 2 years ago
- Code for the NLP4Prog workshop paper "Reading StackOverflow Encourages Cheating: Adding Question TextImproves Extractive Code Generation"☆21Updated 3 years ago
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆14Updated 2 years ago
- ☆29Updated last year
- ☆43Updated 3 months ago
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 3 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆57Updated 2 years ago
- Dataset and code for Findings of EMNLP'21 paper "CodeQA: A Question Answering Dataset for Source Code Comprehension".☆41Updated last year
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Updated 4 years ago
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆22Updated last year
- ☆39Updated 2 years ago