microsoft / JigsawDataset
Jigsaw Dataset: Natural language to Python Pandas code
☆53Updated 2 years ago
Alternatives and similar repositories for JigsawDataset:
Users that are interested in JigsawDataset are comparing it to the libraries listed below
- ☆74Updated last year
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆42Updated last year
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆45Updated last year
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆52Updated 2 years ago
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆81Updated last year
- [EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code☆70Updated 7 months ago
- Code Generator☆23Updated last year
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆122Updated 3 months ago
- ☆59Updated 8 months ago
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆28Updated last month
- CodeBERTScore: an automatic metric for code generation, based on BERTScore☆181Updated 10 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆40Updated 2 weeks ago
- Code for the NLP4Prog workshop paper "Reading StackOverflow Encourages Cheating: Adding Question TextImproves Extractive Code Generation"☆21Updated 3 years ago
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.☆47Updated 3 months ago
- Can Language Models Replace Programmers? RepoCod Says ‘Not Yet’ - by Shanchao Liang and Yiran Hu and Nan Jiang and Lin Tan☆15Updated 2 weeks ago
- ☆22Updated 2 months ago
- Semantic Code Search☆34Updated last year
- Graph-based method for end-to-end code completion with context awareness on repository☆56Updated 4 months ago
- Training language models to make programs faster☆85Updated 9 months ago
- Pretrained Language Models for Source code☆251Updated 3 years ago
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆87Updated 2 years ago
- [EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation☆86Updated 5 months ago
- A basic and simple tool for code auto completion☆59Updated 6 months ago
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆70Updated last year
- SILO Language Models code repository☆81Updated 11 months ago
- xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval☆77Updated 4 months ago
- ☆29Updated last year
- ☆45Updated 2 months ago
- ☆56Updated last week