[NeurIPS'24] SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning
☆28Nov 19, 2024Updated last year
Alternatives and similar repositories for SemCoder
Users that are interested in SemCoder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Program analysis tools built on tree-sitter (https://github.com/tree-sitter/tree-sitter).☆64Nov 24, 2025Updated 5 months ago
- An integration of JoernTI's CodeTIDAL5 neural type inference model.☆29Jan 27, 2025Updated last year
- ☆22Mar 21, 2024Updated 2 years ago
- ☆23Nov 10, 2023Updated 2 years ago
- A collection of datasets for machine learning for big code☆65Oct 8, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Probing pre-trained source code models☆15Apr 27, 2022Updated 4 years ago
- CoditT5: Pretraining for Source Code and Natural Language Editing☆28Jan 16, 2025Updated last year
- ☆11Jul 20, 2021Updated 4 years ago
- EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints☆29Dec 21, 2021Updated 4 years ago
- ☆22Nov 17, 2021Updated 4 years ago
- Web archiving utility library☆11Mar 11, 2026Updated last month
- For our ICSE23 paper "KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair" by Nan Jiang, Thibaud Lutellier, Yiling…☆33Sep 28, 2023Updated 2 years ago
- Python library for code analysis with CPG and Joern☆25Jun 23, 2023Updated 2 years ago
- BDA: Practical Dependence Analysis for Binary Executables by Unbiased Whole-program Path Sampling and Per-path Abstract Interpretation☆31Feb 26, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Dec 23, 2018Updated 7 years ago
- IST'21 & SANER'22: Semantic-Preserving Program Transformations☆31Oct 25, 2022Updated 3 years ago
- The official Implementation for TKDE paper "Individual and Structural Graph Information Bottlenecks for Out-of-Distribution Generalizatio…☆14Aug 6, 2023Updated 2 years ago
- CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context☆19Feb 20, 2026Updated 2 months ago
- The C parser for GumTree☆14Sep 25, 2020Updated 5 years ago
- Release of the ConditionalQA dataset☆21Nov 2, 2021Updated 4 years ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- Utilities for constructing a large dataset of LLVM IR☆25Jun 2, 2025Updated 10 months ago
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This project aims at predicting correlated column pairs in data tables by analyzing column names via large language models.☆11Aug 21, 2023Updated 2 years ago
- Deadline countdowns for academic conferences relevant to the SSE chair.☆13Feb 10, 2026Updated 2 months ago
- The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension☆12Oct 23, 2022Updated 3 years ago
- Evaluating SZZ Implementations Through a Developer-informed Oracle (https://arxiv.org/abs/2102.03300)☆19Nov 3, 2025Updated 5 months ago
- [ICSE 2023] Differentiable interpretation and failure-inducing input generation for neural network numerical bugs.☆13Jan 5, 2024Updated 2 years ago
- [AAAI 2022] Official implementation of the paper Rethinking the Two-Stage Framework for Grounded Situation Recognition, AAAI 2022.☆13Mar 19, 2022Updated 4 years ago
- Automated scalable crash bucketing☆15Oct 2, 2018Updated 7 years ago
- Continuous integration testing dataset☆11Apr 18, 2018Updated 8 years ago
- Enhacing Code Pre-trained Models by Contrastive Learning☆39Mar 8, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14May 26, 2021Updated 4 years ago
- ☆15Nov 28, 2023Updated 2 years ago
- Microsoft Complex Tasks Dataset☆17Jun 12, 2023Updated 2 years ago
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆16Oct 24, 2022Updated 3 years ago
- Implementation of CCS'2022 paper "SymLM: Predicting Function Names in Stripped Binaries via Context-Sensitive Execution-Aware Code Embedd…☆62Jul 6, 2025Updated 9 months ago
- ☆15Feb 24, 2021Updated 5 years ago
- ☆38Apr 1, 2024Updated 2 years ago