Convert source code into numerical tokens
☆66Jul 27, 2023Updated 2 years ago
Alternatives and similar repositories for tokenizer
Users that are interested in tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Smelling smells using Deep Learning☆47Mar 2, 2021Updated 5 years ago
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆13Apr 21, 2024Updated 2 years ago
- Unit testing for SQL queries☆26Aug 16, 2024Updated last year
- C# Data Extraction for "Learning to Represent Edits"☆27Nov 3, 2018Updated 7 years ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Jun 5, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆20Nov 6, 2019Updated 6 years ago
- ☆14May 27, 2022Updated 3 years ago
- Calculate the score of a repository based on best engineering practices.☆117Sep 27, 2020Updated 5 years ago
- Machine learning models for MLonCode trained using the source{d} stack☆19Oct 30, 2019Updated 6 years ago
- ☆18Apr 15, 2024Updated 2 years ago
- fastText pretrained models for semantic representations of source code in Java, Python, PHP, C, C++ and C#.☆17Nov 11, 2020Updated 5 years ago
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆84Mar 24, 2023Updated 3 years ago
- A Source Code Tokenizer☆13Oct 30, 2024Updated last year
- DeepCS: Deep Code Search☆284May 26, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Maven plugin to create HTML report to show dependecies in DSM view.☆14Sep 26, 2023Updated 2 years ago
- FaCoY Code-to-Code Search Engine☆34Jan 18, 2019Updated 7 years ago
- ☆21Jun 3, 2019Updated 6 years ago
- ☆17Dec 9, 2022Updated 3 years ago
- OCaml library to transform an Llvm control flow graph in an SMT formula.☆13Apr 20, 2018Updated 8 years ago
- Accmut is a framework for acclerating mutation testing, which is based on LLVM-IR.☆10Jan 25, 2018Updated 8 years ago
- Manipulate C-family ASTs with Clang☆70Oct 22, 2018Updated 7 years ago
- Babelfish Python client☆17Nov 6, 2019Updated 6 years ago
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,143Sep 20, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Assessing Source Code Semantic Similarity with Unsupervised Learning☆39Feb 27, 2018Updated 8 years ago
- ☆24Jun 17, 2021Updated 4 years ago
- Dependency Structure Matrix☆19Feb 6, 2012Updated 14 years ago
- Simple relational online analytical processing☆31Mar 9, 2026Updated 2 months ago
- Your library for dynamic language modeling☆67Oct 23, 2018Updated 7 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Aug 23, 2021Updated 4 years ago
- An empirical study on patch correctness☆15Nov 5, 2022Updated 3 years ago
- Artifact repository for the paper "Perfect Is the Enemy of Test Oracle", In Proceedings of The 30th ACM Joint European Software Engineeri…☆11May 4, 2023Updated 3 years ago
- Fast time library☆20May 12, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Register-based VM as C library☆11Feb 13, 2016Updated 10 years ago
- clone from myJIT(a fork of GNU lightning)☆11Mar 17, 2015Updated 11 years ago
- source{d} datasets ("big code") for source code analysis and machine learning on source code☆346Nov 27, 2019Updated 6 years ago
- Creates a Lucene index out of files from a local folder☆13Aug 8, 2014Updated 11 years ago
- Source code for the Naturalize project☆57Sep 5, 2015Updated 10 years ago
- ☆14May 28, 2024Updated last year
- We propose a novel DL-based mutation technique (LEAM), which adapts the syntax-guided encoder-decoder architecture to build two sub-model…☆29Jun 16, 2024Updated last year