BK-SCOSS / sctokenizerLinks
A Source Code Tokenizer
☆14Updated last year
Alternatives and similar repositories for sctokenizer
Users that are interested in sctokenizer are comparing it to the libraries listed below
Sorting:
- A Dataset of 600k Java Source Code Changes Categorized by Diff Size http://arxiv.org/pdf/2108.04631☆23Updated last year
- Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)☆59Updated 4 years ago
- Models and datasets for annotated code search.☆35Updated 2 years ago
- A collection of recent papers, benchmarks and datasets of AI4Code domain.☆58Updated last year
- A toolkit for pre-processing large source code corpora☆45Updated 3 years ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆55Updated 3 years ago
- Implementation of "Automatic Source Code Summarization with Extended Tree-LSTM"☆36Updated 3 years ago
- Code implementation for CoTexT: Multi-task Learning with Code-Text Transformer☆36Updated 4 years ago
- ☆23Updated 2 years ago
- Learning to Update Natural Language Comments Based on Code Changes: Artifact☆33Updated 5 years ago
- Reproduce the results of Tree-based Convolutional Neural Network (TBCNN)☆39Updated 2 years ago
- Semantic Code Search☆37Updated 2 years ago
- A curated list of software engineering research, data set, tool.☆33Updated 3 years ago
- an implementation of "code2vec: Learning Distributed Representations of Code"☆30Updated last year
- ☆24Updated 4 years ago
- ☆29Updated 3 years ago
- This repo is the benchmark for source code summarization on C language☆26Updated 4 years ago
- Code for "Deep Graph Matching and Searching for Semantic Code Retrieval"☆24Updated 4 years ago
- Set of tools to help working with "Big Code"☆42Updated 3 years ago
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆186Updated 3 years ago
- Implementation of the paper "Language-agnostic representation learning of source code from structure and context".☆172Updated 3 years ago
- Code for the paper "A Structural Model for Contextual Code Changes"☆32Updated 2 years ago
- Hoppity☆60Updated 5 years ago
- TDCleaner: A Tool for Detecting Obsolete TODO Comments in Software Repos☆12Updated 4 years ago
- Replication Code for "Self-Supervised Bug Detection and Repair" NeurIPS 2021☆112Updated 3 years ago
- ☆18Updated 3 years ago
- ESEC/FSE'21: Prediction-Preserving Program Simplification☆10Updated 3 years ago
- Contrastive Code Representation Learning: functionality-based JavaScript embeddings through self-supervised learning☆169Updated 4 years ago
- Deep Just-In-Time Inconsistency Detection Between Comments and Source Code: Artifact☆22Updated 6 months ago
- Source Code Data Augmentation for Deep Learning: A Survey.☆66Updated last year