Fast tokenization and structural analysis of any programming language
☆62Jan 14, 2025Updated last year
Alternatives and similar repositories for code_tokenize
Users that are interested in code_tokenize are comparing it to the libraries listed below
Sorting:
- Fast and robust AST parsing of any language☆67Nov 11, 2025Updated 4 months ago
- Fast AST based code differencing in Python☆44Jan 31, 2026Updated last month
- MODIT: On Multi-Modal Learning of Editing Source Code.☆20Apr 24, 2021Updated 4 years ago
- ESEC/FSE'21: Prediction-Preserving Program Simplification☆10Oct 4, 2022Updated 3 years ago
- Generating Adversarial Examples for Holding Robustness of Source Code Processing Models☆15Dec 2, 2021Updated 4 years ago
- Code Generation as a Dual Task of Code Summarization.☆30Jun 28, 2021Updated 4 years ago
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆65Apr 18, 2022Updated 3 years ago
- ☆13Jul 6, 2023Updated 2 years ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆15Feb 24, 2022Updated 4 years ago
- Some writeups in ctf.☆11Mar 31, 2022Updated 3 years ago
- Set of tools to help working with "Big Code"☆42Apr 28, 2022Updated 3 years ago
- Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".☆17May 25, 2025Updated 9 months ago
- Program Translator AI built on Pytorch☆15Dec 19, 2019Updated 6 years ago
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Aug 19, 2020Updated 5 years ago
- Mining tool and large-scale datasets of single statement bug fixes in Python☆19Nov 29, 2023Updated 2 years ago
- ☆41Jan 13, 2023Updated 3 years ago
- Stuff related to scraping the Code Review StackExchange☆12Jan 19, 2023Updated 3 years ago
- ☆24Jun 17, 2021Updated 4 years ago
- This repo is the benchmark for source code summarization on C language☆26Mar 18, 2021Updated 5 years ago
- IST'21 & SANER'22: Semantic-Preserving Program Transformations☆31Oct 25, 2022Updated 3 years ago
- Artifact repository for the paper "Perfect Is the Enemy of Test Oracle", In Proceedings of The 30th ACM Joint European Software Engineeri…☆11May 4, 2023Updated 2 years ago
- Sample code for 3rd party developers working on Android On Snapdragon☆12Sep 4, 2024Updated last year
- Information about the CodedotAI reading group sessions.☆12Aug 16, 2021Updated 4 years ago
- Jigsaw Dataset: Natural language to Python Pandas code☆55Dec 18, 2022Updated 3 years ago
- code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"☆12Dec 13, 2024Updated last year
- ☆29Aug 25, 2023Updated 2 years ago
- ☆18Apr 15, 2024Updated last year
- ☆53Sep 11, 2021Updated 4 years ago
- Code Snippet Recommendation from Stack Overflow Post☆19Jun 30, 2021Updated 4 years ago
- [AAAI 2021] - TreeCaps: Tree-based Capsule Network for Source Code Processing☆23Mar 24, 2023Updated 2 years ago
- ICSE 2021 Artifact for: Shipwright: A Human-in-the-Loop System for Dockerfile Repair.☆23May 11, 2021Updated 4 years ago
- ☆23Aug 6, 2020Updated 5 years ago
- Replication package for EMNLP2022 paper- RACE: Retrieval-Augmented Commit Message Generation☆20Oct 21, 2022Updated 3 years ago
- Models and datasets for annotated code search.☆35May 22, 2023Updated 2 years ago
- BGraph is a tool designed to generate dependencies graphs from Android.bp soong files.☆20Sep 19, 2025Updated 6 months ago
- This project provides several implementations for commit untangling and proposes a new representation of git patches by projecting the pa…☆12Jul 28, 2025Updated 7 months ago
- ☆16Nov 29, 2019Updated 6 years ago
- repo for the paper titled “CodeGen4Libs: A Two-Stage Approach for Library-Oriented Code Generation”☆14Oct 4, 2023Updated 2 years ago
- code and data for paper "Automatic Generation and Summarization of Shellcode via Transformer and Dual Learning", which accepted in SANER …☆13May 8, 2022Updated 3 years ago