Convert source code into numerical tokens
☆66Jul 27, 2023Updated 2 years ago
Alternatives and similar repositories for tokenizer
Users that are interested in tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Smelling smells using Deep Learning☆47Mar 2, 2021Updated 5 years ago
- Unit testing for SQL queries☆26Aug 16, 2024Updated last year
- C# Data Extraction for "Learning to Represent Edits"☆27Nov 3, 2018Updated 7 years ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Jun 5, 2020Updated 6 years ago
- ☆15May 27, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Improving Code Readability Classification using Convolutional Neural Networks☆10Apr 18, 2018Updated 8 years ago
- Calculate the score of a repository based on best engineering practices.☆119Sep 27, 2020Updated 5 years ago
- Machine learning models for MLonCode trained using the source{d} stack☆19Oct 30, 2019Updated 6 years ago
- ☆18Apr 15, 2024Updated 2 years ago
- fastText pretrained models for semantic representations of source code in Java, Python, PHP, C, C++ and C#.☆17Nov 11, 2020Updated 5 years ago
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆84Mar 24, 2023Updated 3 years ago
- Maven plugin to create HTML report to show dependecies in DSM view.☆14Sep 26, 2023Updated 2 years ago
- FaCoY Code-to-Code Search Engine☆34Jan 18, 2019Updated 7 years ago
- ☆21Jun 3, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17Dec 9, 2022Updated 3 years ago
- OCaml library to transform an Llvm control flow graph in an SMT formula.☆13Apr 20, 2018Updated 8 years ago
- Accmut is a framework for acclerating mutation testing, which is based on LLVM-IR.☆10Jan 25, 2018Updated 8 years ago
- Manipulate C-family ASTs with Clang☆70Oct 22, 2018Updated 7 years ago
- Babelfish Python client☆17Nov 6, 2019Updated 6 years ago
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,144Sep 20, 2023Updated 2 years ago
- Assessing Source Code Semantic Similarity with Unsupervised Learning☆39Feb 27, 2018Updated 8 years ago
- ☆24Jun 17, 2021Updated 5 years ago
- A dynamic method for detecting faults in incremental and parallel builds.☆18Jul 27, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Hexagon processor module for IDA Pro disassembler☆19Oct 11, 2022Updated 3 years ago
- Simple relational online analytical processing☆31Mar 9, 2026Updated 3 months ago
- Your library for dynamic language modeling☆69Oct 23, 2018Updated 7 years ago
- 2019 年开源年度报告☆11Jan 7, 2020Updated 6 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Aug 23, 2021Updated 4 years ago
- An empirical study on patch correctness☆15Nov 5, 2022Updated 3 years ago
- Artifact repository for the paper "Perfect Is the Enemy of Test Oracle", In Proceedings of The 30th ACM Joint European Software Engineeri…☆11May 4, 2023Updated 3 years ago
- AST factorization: transformation AST of Kotlin source code to a vector☆11Oct 17, 2019Updated 6 years ago
- Greek translation of the IEEE Software Engineering Body of Knowledge☆23Apr 28, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- clone from myJIT(a fork of GNU lightning)☆11Mar 17, 2015Updated 11 years ago
- source{d} datasets ("big code") for source code analysis and machine learning on source code☆348Nov 27, 2019Updated 6 years ago
- Creates a Lucene index out of files from a local folder☆13Aug 8, 2014Updated 11 years ago
- Source code for the Naturalize project☆57Sep 5, 2015Updated 10 years ago
- Privacy-preserving epidemic dosimeter based on DP-3T contact tracing☆53May 3, 2021Updated 5 years ago
- ☆14May 28, 2024Updated 2 years ago
- Build kaldi inside docker containers with option for CUDA support☆12Feb 6, 2017Updated 9 years ago