A redistributable subset of the ETH Py150 corpus [https://www.sri.inf.ethz.ch/py150], introduced in the ICML 2020 paper 'Learning and Evaluating Contextual Embedding of Source Code' [https://proceedings.icml.cc/static/paper_files/icml/2020/5401-Paper.pdf].
☆32Aug 11, 2020Updated 5 years ago
Alternatives and similar repositories for eth_py150_open
Users that are interested in eth_py150_open are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Aug 19, 2020Updated 5 years ago
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆13Apr 21, 2024Updated 2 years ago
- code2vec for Python 3 made for NL2ML project☆18Mar 25, 2023Updated 3 years ago
- Code Generation as a Dual Task of Code Summarization.☆30Jun 28, 2021Updated 4 years ago
- Utilities used by the Deep Program Understanding team☆104Jun 12, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆16Jun 18, 2024Updated last year
- Mining tool and large-scale datasets of single statement bug fixes in Python☆19Nov 29, 2023Updated 2 years ago
- Empirical Study of Transformers for Source Code & A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Sourc…☆66Dec 3, 2021Updated 4 years ago
- A collection of datasets for machine learning for big code☆65Oct 8, 2021Updated 4 years ago
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆90Apr 5, 2022Updated 4 years ago
- ☆14May 31, 2021Updated 4 years ago
- Extracting Concise Bug-Fixing Patches from Human-Written Patches in Version Control Systems☆16Feb 21, 2023Updated 3 years ago
- Library for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Str…☆21Oct 22, 2018Updated 7 years ago
- Static heap reachability analysis for Java bytecode and Android memory leak finder.☆33Oct 25, 2014Updated 11 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- numcodecs Rust API for buffer compression☆12Updated this week
- I/O utilities and datasets for algebraic-graphs☆14Aug 29, 2022Updated 3 years ago
- sliding fast fourier transform using haskell streaming☆13Feb 19, 2019Updated 7 years ago
- ☆16Nov 12, 2025Updated 5 months ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Jun 5, 2020Updated 5 years ago
- Haskell implementation of Glumpy☆12Jun 21, 2021Updated 4 years ago
- Invadium runs exploit playbooks against vulnerable target applications in an intuitive, reproducible, and well-defined manner.☆11Apr 27, 2023Updated 3 years ago
- Code for "Generative Code Modeling with Graphs" (ICLR'19)☆172Dec 8, 2022Updated 3 years ago
- Haskell numerical ODE solvers☆14Aug 21, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code to reproduce the experiments in the paper Open Vocabulary Learning on Source Code with a Graph-Structured Cache☆21Apr 15, 2019Updated 7 years ago
- Data and Code for Reproducing "Global Relational Models of Source Code"☆85May 10, 2021Updated 4 years ago
- Fine-grained lattice primitives for Haskell☆18Mar 8, 2018Updated 8 years ago
- Honeyquest is a cyber security game that asks humans to distinguish neutral, risky, and deceptive payloads. Honeyquest presents participa…☆14Jan 8, 2026Updated 3 months ago
- HaVSA (Have-Saa) is a Haskell implementation of the Version Space Algebra Machine Learning technique described by Tessa Lau.☆12Jul 8, 2017Updated 8 years ago
- A Haskell implementation of distributed hash tables with two-phase commit.☆10Dec 9, 2016Updated 9 years ago
- LibSSH2 FFI bindings for Haskell☆26Apr 3, 2025Updated last year
- The Elements of Statistical Learning in Haskell☆13Nov 29, 2017Updated 8 years ago
- A collection of scripts to parse Indian Budget documents into clean machine readable formats.☆15Dec 1, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of our work, A Transformer-based Approach for Source Code Summarization [ACL 2020].☆193May 28, 2022Updated 3 years ago
- "Fail Fast" process management for Haskell; inspired by Erlang☆16Jan 19, 2017Updated 9 years ago
- This repo is the benchmark for source code summarization on C language☆26Mar 18, 2021Updated 5 years ago