Convert source code into numerical tokens
☆66Jul 27, 2023Updated 2 years ago
Alternatives and similar repositories for tokenizer
Users that are interested in tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Smelling smells using Deep Learning☆47Mar 2, 2021Updated 5 years ago
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆13Apr 21, 2024Updated last year
- C# Data Extraction for "Learning to Represent Edits"☆27Nov 3, 2018Updated 7 years ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Jun 5, 2020Updated 5 years ago
- ☆20Nov 6, 2019Updated 6 years ago
- ☆14May 27, 2022Updated 3 years ago
- Improving Code Readability Classification using Convolutional Neural Networks☆10Apr 18, 2018Updated 7 years ago
- Machine learning models for MLonCode trained using the source{d} stack☆19Oct 30, 2019Updated 6 years ago
- ☆18Apr 15, 2024Updated last year
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆84Mar 24, 2023Updated 3 years ago
- A Source Code Tokenizer☆14Oct 30, 2024Updated last year
- DeepCS: Deep Code Search☆283May 26, 2022Updated 3 years ago
- Maven plugin to create HTML report to show dependecies in DSM view.☆14Sep 26, 2023Updated 2 years ago
- FaCoY Code-to-Code Search Engine☆34Jan 18, 2019Updated 7 years ago
- ☆22Jun 3, 2019Updated 6 years ago
- ☆17Dec 9, 2022Updated 3 years ago
- OCaml library to transform an Llvm control flow graph in an SMT formula.☆13Apr 20, 2018Updated 7 years ago
- Accmut is a framework for acclerating mutation testing, which is based on LLVM-IR.☆10Jan 25, 2018Updated 8 years ago
- Manipulate C-family ASTs with Clang☆68Oct 22, 2018Updated 7 years ago
- Babelfish Python client☆17Nov 6, 2019Updated 6 years ago
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,143Sep 20, 2023Updated 2 years ago
- Assessing Source Code Semantic Similarity with Unsupervised Learning☆40Feb 27, 2018Updated 8 years ago
- ☆24Jun 17, 2021Updated 4 years ago
- Dependency Structure Matrix☆19Feb 6, 2012Updated 14 years ago
- ProgQuery is a system to extract useful syntactic and semantic information from source code programs and store it in a graph database for…☆17Jan 22, 2025Updated last year
- A customized version of Ella used in the paper `An Empirical Study of Android Test Generation Tools in Industrial Cases`.☆10Aug 19, 2020Updated 5 years ago
- A Python tool used for parsing exam questionnaires☆16May 7, 2025Updated 10 months ago
- Hexagon processor module for IDA Pro disassembler☆19Oct 11, 2022Updated 3 years ago
- A benchmark suite for performance-oriented shell-optimization research☆30Nov 6, 2025Updated 4 months ago
- Simple relational online analytical processing☆31Mar 9, 2026Updated 2 weeks ago
- moneymoney extension for payback accounts☆13Nov 5, 2017Updated 8 years ago
- Your library for dynamic language modeling☆67Oct 23, 2018Updated 7 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Aug 23, 2021Updated 4 years ago
- An empirical study on patch correctness☆15Nov 5, 2022Updated 3 years ago
- Artifact repository for the paper "Perfect Is the Enemy of Test Oracle", In Proceedings of The 30th ACM Joint European Software Engineeri…☆11May 4, 2023Updated 2 years ago
- AST factorization: transformation AST of Kotlin source code to a vector☆11Oct 17, 2019Updated 6 years ago
- Privacy-preserving epidemic dosimeter based on DP-3T contact tracing☆53May 3, 2021Updated 4 years ago
- Source code for the Naturalize project☆56Sep 5, 2015Updated 10 years ago
- Command line and webapp application for driving Sonos boxes☆28Feb 1, 2015Updated 11 years ago