[NeurIPS'24] SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning
☆28Nov 19, 2024Updated last year
Alternatives and similar repositories for SemCoder
Users that are interested in SemCoder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Program analysis tools built on tree-sitter (https://github.com/tree-sitter/tree-sitter).☆70Nov 24, 2025Updated 6 months ago
- [ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain☆10Nov 24, 2025Updated 6 months ago
- ☆22Mar 21, 2024Updated 2 years ago
- ☆24Nov 10, 2023Updated 2 years ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆169Oct 11, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Probing pre-trained source code models☆15Apr 27, 2022Updated 4 years ago
- ☆11Jul 20, 2021Updated 4 years ago
- CoditT5: Pretraining for Source Code and Natural Language Editing☆29Jan 16, 2025Updated last year
- EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints☆29Dec 21, 2021Updated 4 years ago
- ☆10May 14, 2024Updated 2 years ago
- Web archiving utility library☆11May 5, 2026Updated last month
- For our ICSE23 paper "KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair" by Nan Jiang, Thibaud Lutellier, Yiling…☆33Sep 28, 2023Updated 2 years ago
- Python library for code analysis with CPG and Joern☆25Jun 23, 2023Updated 2 years ago
- BDA: Practical Dependence Analysis for Binary Executables by Unbiased Whole-program Path Sampling and Per-path Abstract Interpretation☆31Feb 26, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- ☆11Dec 23, 2018Updated 7 years ago
- The official Implementation for TKDE paper "Individual and Structural Graph Information Bottlenecks for Out-of-Distribution Generalizatio…☆14Aug 6, 2023Updated 2 years ago
- CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context☆19Feb 20, 2026Updated 3 months ago
- Artifact repository for the paper "Perfect Is the Enemy of Test Oracle", In Proceedings of The 30th ACM Joint European Software Engineeri…☆11May 4, 2023Updated 3 years ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- Utilities for constructing a large dataset of LLVM IR☆25Jun 2, 2025Updated last year
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 4 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Extracts static code features from opencl kernels to be used for machine learning.☆10Apr 30, 2021Updated 5 years ago
- This project aims at predicting correlated column pairs in data tables by analyzing column names via large language models.☆11Aug 21, 2023Updated 2 years ago
- Deadline countdowns for academic conferences relevant to the SSE chair.☆13Updated this week
- The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension☆12Oct 23, 2022Updated 3 years ago
- Evaluating SZZ Implementations Through a Developer-informed Oracle (https://arxiv.org/abs/2102.03300)☆19Nov 3, 2025Updated 7 months ago
- Continuous integration testing dataset☆11Apr 18, 2018Updated 8 years ago
- [ICSE 2023] Differentiable interpretation and failure-inducing input generation for neural network numerical bugs.☆13Jan 5, 2024Updated 2 years ago
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- Automated scalable crash bucketing☆15Oct 2, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆14May 26, 2021Updated 5 years ago
- A Symbolic Execution Engine for Dynamic Kernel Analysis☆34Jun 16, 2024Updated last year
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆16Oct 24, 2022Updated 3 years ago
- Microsoft Complex Tasks Dataset☆17Jun 12, 2023Updated 2 years ago
- Implementation of CCS'2022 paper "SymLM: Predicting Function Names in Stripped Binaries via Context-Sensitive Execution-Aware Code Embedd…☆62Jul 6, 2025Updated 11 months ago
- Coeditor: Leveraging Repo-level Diffs for Code Auto-editing☆31Feb 25, 2024Updated 2 years ago
- ☆15Feb 24, 2021Updated 5 years ago