[NeurIPS'24] SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning
☆27Nov 19, 2024Updated last year
Alternatives and similar repositories for SemCoder
Users that are interested in SemCoder are comparing it to the libraries listed below
Sorting:
- Program analysis tools built on tree-sitter (https://github.com/tree-sitter/tree-sitter).☆62Nov 24, 2025Updated 3 months ago
- An integration of JoernTI's CodeTIDAL5 neural type inference model.☆27Jan 27, 2025Updated last year
- [ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain☆10Nov 24, 2025Updated 3 months ago
- ☆22Mar 21, 2024Updated last year
- ☆16Mar 22, 2024Updated last year
- ☆23Nov 10, 2023Updated 2 years ago
- A collection of datasets for machine learning for big code☆62Oct 8, 2021Updated 4 years ago
- CoditT5: Pretraining for Source Code and Natural Language Editing☆28Jan 16, 2025Updated last year
- ☆11Jul 20, 2021Updated 4 years ago
- EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints☆29Dec 21, 2021Updated 4 years ago
- ☆11May 14, 2024Updated last year
- TeCo: an ML+Execution model for test completion☆31Jun 16, 2024Updated last year
- A toy implementation about Program Dependence Graph using LLVM☆13Sep 27, 2023Updated 2 years ago
- IST'21 & SANER'22: Semantic-Preserving Program Transformations☆31Oct 25, 2022Updated 3 years ago
- The official Implementation for TKDE paper "Individual and Structural Graph Information Bottlenecks for Out-of-Distribution Generalizatio…☆14Aug 6, 2023Updated 2 years ago
- The C parser for GumTree☆14Sep 25, 2020Updated 5 years ago
- Release of the ConditionalQA dataset☆21Nov 2, 2021Updated 4 years ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- Utilities for constructing a large dataset of LLVM IR☆25Jun 2, 2025Updated 9 months ago
- ☆13Feb 14, 2022Updated 4 years ago
- Extract and combine multiple source code views using tree-sitter☆157Sep 17, 2025Updated 6 months ago
- Extracts static code features from opencl kernels to be used for machine learning.☆10Apr 30, 2021Updated 4 years ago
- Deadline countdowns for academic conferences relevant to the SSE chair.☆13Feb 10, 2026Updated last month
- The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension☆12Oct 23, 2022Updated 3 years ago
- Evaluating SZZ Implementations Through a Developer-informed Oracle (https://arxiv.org/abs/2102.03300)☆19Nov 3, 2025Updated 4 months ago
- Demo about Eclipse IDE, Che, LSP4E, LSP4J and JDT-LS related to the Language Server Protocol☆12Dec 7, 2018Updated 7 years ago
- [AAAI 2022] Official implementation of the paper Rethinking the Two-Stage Framework for Grounded Situation Recognition, AAAI 2022.☆13Mar 19, 2022Updated 4 years ago
- Continuous integration testing dataset☆12Apr 18, 2018Updated 7 years ago
- [ICSE 2023] Differentiable interpretation and failure-inducing input generation for neural network numerical bugs.☆13Jan 5, 2024Updated 2 years ago
- Enhacing Code Pre-trained Models by Contrastive Learning☆38Mar 8, 2023Updated 3 years ago
- The code for the Mimic and Rephrase paper☆13Mar 19, 2023Updated 3 years ago
- A Symbolic Execution Engine for Dynamic Kernel Analysis☆33Jun 16, 2024Updated last year
- ☆15Nov 28, 2023Updated 2 years ago
- Microsoft Complex Tasks Dataset☆17Jun 12, 2023Updated 2 years ago
- Coeditor: Leveraging Repo-level Diffs for Code Auto-editing☆32Feb 25, 2024Updated 2 years ago
- Implementation of CCS'2022 paper "SymLM: Predicting Function Names in Stripped Binaries via Context-Sensitive Execution-Aware Code Embedd…☆63Jul 6, 2025Updated 8 months ago
- ☆15Feb 24, 2021Updated 5 years ago
- ☆38Apr 1, 2024Updated last year
- Code and data for "Impact of Evaluation Methodologies on Code Summarization" in ACL 2022.☆10Sep 6, 2022Updated 3 years ago