CUHK-ARISE / CodeCrashLinks
[NeurIPS 2025] CodeCrash: Exposing LLM Fragility to Misleading Natural Language in Code Reasoning
☆16Updated 2 weeks ago
Alternatives and similar repositories for CodeCrash
Users that are interested in CodeCrash are comparing it to the libraries listed below
Sorting:
- Code for ACL (main) paper "JumpCoder: Go Beyond Autoregressive Coder via Online Modification"☆27Updated last year
- Repo-Level Code generation papers☆232Updated last month
- FlexNF: Flexible Network Function Orchestration for Scalable On-Path Service Chain Serving, ToN 2023 && IWQoS 2021☆19Updated last year
- ☆25Updated 6 months ago
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆85Updated last year
- This repo is for our submission for ICSE 2025.☆20Updated last year
- [VLDB'2025] LEAP: LLM-powered End-to-end Automatic Library for Processing Social Science Queries on Unstructured Data☆19Updated 3 months ago
- [ICSE'25] Aligning the Objective of LLM-based Program Repair☆23Updated 10 months ago
- [TOSEM'25] The official GitHub page for the survey paper "A Survey on Large Language Models for Code Generation".☆183Updated 6 months ago
- A Systematic Literature Review on Large Language Models for Automated Program Repair☆228Updated 2 weeks ago
- Simultaneous evaluation on both functionality and security of LLM-generated code.☆31Updated 2 months ago
- Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"☆17Updated 10 months ago
- ☆12Updated 3 years ago
- [TOSEM 2023] A Survey of Learning-based Automated Program Repair☆75Updated last year
- Adversarial Attack for Pre-trained Code Models☆10Updated 3 years ago
- ☆21Updated last year
- A comprehensive code domain benchmark review of LLM researches.☆194Updated 4 months ago
- ☆11Updated last year
- Neural Code Intelligence Survey 2024-25; Reading lists and resources☆280Updated 6 months ago
- This repo illustrates how to evaluate the artifacts in the paper An Extensive Study on Pre-trained Models for Program Understanding and G…☆27Updated 3 years ago
- AutoLog: A Log Sequence Synthesis Framework for Anomaly Detection [ASE'23]☆41Updated last year
- ☆51Updated last year
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆67Updated last year
- A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories☆36Updated last year
- Benchmark ClassEval for class-level code generation.☆145Updated last year
- [ISSTA 2025] A Large-scale Empirical Study on Fine-tuning Large Language Models for Unit Testing☆13Updated 11 months ago
- Making code edting up to 7.7x faster using multi-layer speculation☆24Updated 11 months ago
- ☆61Updated 2 years ago
- Automated Benchmarking of LLM Agents on Real-World Software Security Tasks [NeurIPS 2025]☆55Updated last week
- This repository hosts the source code for the paper "ROCODE: Integrating Backtracking Mechanism and Program Analysis in Large Language Mo…☆16Updated last month