microsoft / DataScienceProblems
A repository containing the Jupyter notebook code generation benchmark.
☆58Updated 2 years ago
Alternatives and similar repositories for DataScienceProblems:
Users that are interested in DataScienceProblems are comparing it to the libraries listed below
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆42Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆38Updated 8 months ago
- Code for generating the JuICe dataset.☆37Updated 3 years ago
- Code Generator☆23Updated last year
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆45Updated last year
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated 2 years ago
- ☆29Updated 11 months ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- ☆28Updated last year
- ☆55Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆26Updated last year
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 2 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Updated 3 years ago
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆53Updated 5 months ago
- Web queries dataset for code search☆31Updated last year
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆24Updated 2 years ago
- VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning☆38Updated 2 years ago
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆81Updated last year
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆75Updated 9 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated 9 months ago
- ☆52Updated last year
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆41Updated 2 years ago
- ☆22Updated 2 months ago
- ☆40Updated 4 months ago
- code for "Natural Language to Code Translation with Execution"☆40Updated 2 years ago
- Code, datasets and results of the ChatGPT evaluation presented in paper "ChatGPT: Jack of all trades, master of none"☆29Updated last year
- ☆45Updated last year
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆22Updated last year
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆52Updated 9 months ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year