SMAT-Lab / SnifferDog
Restoring Execution Environments of Jupyter Notebooks
☆20Updated last year
Related projects: ⓘ
- ☆13Updated 5 months ago
- ManyTypes4Py: A benchmark Python dataset for machine learning-based type inference☆18Updated 2 years ago
- ML models often mispredict, and it is hard to tell when and why. We present a data mining based approach to discover whether there is a c…☆18Updated 2 years ago
- an implementation of "code2vec: Learning Distributed Representations of Code"☆29Updated 2 months ago
- Type4Py: Deep Similarity Learning-Based Type Inference for Python☆61Updated last year
- Code for "Typilus: Neural Type Hints" PLDI 2020☆59Updated last year
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆88Updated 2 years ago
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Updated 4 years ago
- Flow graphs for Python☆25Updated 2 years ago
- A toolkit for pre-processing large source code corpora☆45Updated last year
- A Systematic Literature Review of Deep Learning in Software Engineering☆18Updated 3 weeks ago
- BugsInPy: Benchmarking Bugs in Python Projects☆77Updated 2 months ago
- Fork of the awesome function_parser library from Github's CodeSearchNet Challenge repo: https://github.com/github/CodeSearchNet/tree/mast…☆24Updated last year
- TeCo: an ML+Execution model for test completion☆26Updated 3 months ago
- ☆16Updated 2 months ago
- ☆15Updated 2 years ago
- Mining tool and large-scale datasets of single statement bug fixes in Python☆14Updated 9 months ago
- CoditT5: Pretraining for Source Code and Natural Language Editing☆29Updated last year
- ☆33Updated 2 years ago
- Data and Code for Reproducing "Global Relational Models of Source Code"☆82Updated 3 years ago
- Artifact repository for the paper "Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code", In P…☆38Updated 3 months ago
- Evaluation of source authorship attribution tool☆21Updated 3 years ago
- Code for ICML 2021 paper: How could Neural Networks understand Programs?☆122Updated 3 years ago
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆53Updated last month
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆24Updated 2 months ago
- Source Code Data Augmentation for Deep Learning: A Survey.☆58Updated 3 months ago
- This is the artifact for paper “Are Machine Learning Cloud APIs Used Correctly? (#421)” in ICSE2021☆15Updated 3 years ago
- ☆23Updated last year
- This repository contains the dataset of our ISSTA 2018 paper: An Empirical Study on TensorFlow Program Bugs.☆30Updated 4 years ago
- OOPSLA 2019 Artifact for AutoPandas. Website at https://rbavishi.github.io/autopandas☆30Updated last year