microsoft / DataScienceProblems
A repository containing the Jupyter notebook code generation benchmark.
☆57Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for DataScienceProblems
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆42Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆38Updated 7 months ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆44Updated 11 months ago
- ☆29Updated 9 months ago
- ☆44Updated last year
- Code for generating the JuICe dataset.☆37Updated 3 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆68Updated last year
- ☆55Updated last year
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆22Updated 2 years ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated 10 months ago
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆41Updated last year
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆24Updated last year
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated last year
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Weakly Supervised Text-to-SQL Parsing through Question Decomposition☆22Updated last year
- Code, datasets and results of the ChatGPT evaluation presented in paper "ChatGPT: Jack of all trades, master of none"☆29Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆71Updated 2 years ago
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆23Updated last month
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆84Updated 2 years ago
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆79Updated last year
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Updated 3 years ago
- ☆30Updated last year
- Code for GenAug: Data Augmentation for Finetuning Text Generators.☆26Updated 3 years ago
- Few-shot Learning with Auxiliary Data☆26Updated 11 months ago
- ☆25Updated 2 years ago
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆66Updated this week
- ☆39Updated 2 years ago
- Code for the NLP4Prog workshop paper "Reading StackOverflow Encourages Cheating: Adding Question TextImproves Extractive Code Generation"☆21Updated 3 years ago