microsoft / DataScienceProblems
A repository containing the Jupyter notebook code generation benchmark.
☆58Updated 3 years ago
Alternatives and similar repositories for DataScienceProblems:
Users that are interested in DataScienceProblems are comparing it to the libraries listed below
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆42Updated 2 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆46Updated last year
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆83Updated last year
- ☆74Updated last year
- Code, datasets and results of the ChatGPT evaluation presented in paper "ChatGPT: Jack of all trades, master of none"☆29Updated last year
- ☆45Updated last year
- Code for the NLP4Prog workshop paper "Reading StackOverflow Encourages Cheating: Adding Question TextImproves Extractive Code Generation"☆21Updated 3 years ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆39Updated 10 months ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated 2 years ago
- A plugin for code generation in PyCharm/IntelliJ using tranX☆35Updated 2 years ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆71Updated 2 years ago
- ☆29Updated last year
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- ☆31Updated last year
- ☆55Updated 2 years ago
- Embedding Recycling for Language models☆38Updated last year
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆32Updated 2 months ago
- ☆30Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated last year
- Code for generating the JuICe dataset.☆36Updated 3 years ago
- A repository to perform self-instruct with a model on HF Hub☆32Updated last year
- ☆39Updated 2 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 2 years ago
- Implementation of the Paper "Goal-Driven Explainable Clustering via Language Descriptions"☆36Updated last year
- Few-shot Learning with Auxiliary Data☆26Updated last year
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆42Updated 3 weeks ago
- ☆22Updated 3 months ago
- [EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code☆72Updated 8 months ago
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆22Updated last year