microsoft / DataScienceProblemsLinks
A repository containing the Jupyter notebook code generation benchmark.
☆61Updated 3 years ago
Alternatives and similar repositories for DataScienceProblems
Users that are interested in DataScienceProblems are comparing it to the libraries listed below
Sorting:
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆45Updated 2 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆48Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆42Updated 4 months ago
- Code for generating the JuICe dataset.☆37Updated 3 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆58Updated 2 years ago
- ☆53Updated last year
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆27Updated 2 years ago
- ☆39Updated 2 years ago
- ☆119Updated 11 months ago
- Code, datasets and results of the ChatGPT evaluation presented in paper "ChatGPT: Jack of all trades, master of none"☆29Updated 2 years ago
- ☆48Updated last year
- ☆77Updated 3 months ago
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆73Updated 2 years ago
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆88Updated 2 years ago
- Dataset and code for Findings of EMNLP'21 paper "CodeQA: A Question Answering Dataset for Source Code Comprehension".☆40Updated last year
- code for "Natural Language to Code Translation with Execution"☆41Updated 2 years ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆52Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆58Updated last year
- Repository for Decomposed Prompting☆91Updated last year
- Web queries dataset for code search☆32Updated 2 years ago
- ☆38Updated 2 years ago
- ☆34Updated 2 years ago
- ☆29Updated last year
- Code Generator☆23Updated 2 years ago
- ☆26Updated 2 weeks ago
- In-context Example Selection with Influences☆15Updated 2 years ago
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆20Updated 3 weeks ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 3 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".☆246Updated 8 months ago