[ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain
☆10Nov 24, 2025Updated 5 months ago
Alternatives and similar repositories for IdentityChain
Users that are interested in IdentityChain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆16Oct 24, 2022Updated 3 years ago
- DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment☆16Jan 23, 2024Updated 2 years ago
- This is the project repository of our ASE22 paper: Natural Test Generation for Precise Testing of Question Answering Software☆14Dec 1, 2022Updated 3 years ago
- ☆23Nov 10, 2023Updated 2 years ago
- [NeurIPS'24] SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning☆28Nov 19, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Generating Adversarial Examples for Holding Robustness of Source Code Processing Models☆16Dec 2, 2021Updated 4 years ago
- This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Pr…☆26Jun 27, 2022Updated 3 years ago
- ☆11May 14, 2024Updated last year
- Probing pre-trained source code models☆15Apr 27, 2022Updated 4 years ago
- ☆15Mar 27, 2026Updated last month
- ☆12Jul 8, 2023Updated 2 years ago
- 李鲁鲁老师的 Copilot-Python 学习。和ChatGPT等大语言模型协同进化。☆10Jun 3, 2025Updated 10 months ago
- Grounding Language Models for Compositional and Spatial Reasoning☆18Oct 26, 2022Updated 3 years ago
- ☆16Jun 4, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Nov 6, 2024Updated last year
- EvoEval: Evolving Coding Benchmarks via LLM☆81Apr 6, 2024Updated 2 years ago
- Towards Better Graph Representation Learning with Parameterized Decomposition & Filtering☆13Aug 22, 2023Updated 2 years ago
- CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context☆19Feb 20, 2026Updated 2 months ago
- ☆26Jul 19, 2022Updated 3 years ago
- ☆34Jan 15, 2026Updated 3 months ago
- ☆12Jul 30, 2025Updated 9 months ago
- ☆33Sep 14, 2025Updated 7 months ago
- An Algorithm to Quantify Robustness of Recurrent Neural Networks☆49Apr 24, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Oct 11, 2024Updated last year
- Code of On L-p Robustness of Decision Stumps and Trees, ICML 2020☆10Aug 3, 2020Updated 5 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- Open-source repository for the ISSTA'23 paper "CONCORD: Clone-aware Contrastive Learning for Source Code"☆11Nov 10, 2023Updated 2 years ago
- µTune: Auto-Tuned Threading for OLDI Microservices☆14Oct 8, 2018Updated 7 years ago
- ☆12Dec 9, 2020Updated 5 years ago
- Extracts static code features from opencl kernels to be used for machine learning.☆10Apr 30, 2021Updated 5 years ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- IST'21 & SANER'22: Semantic-Preserving Program Transformations☆31Oct 25, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Enhacing Code Pre-trained Models by Contrastive Learning☆39Mar 8, 2023Updated 3 years ago
- ☆41Jan 13, 2023Updated 3 years ago
- Deadline countdowns for academic conferences relevant to the SSE chair.☆13Feb 10, 2026Updated 2 months ago
- The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension☆12Oct 23, 2022Updated 3 years ago
- [LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization☆41Mar 7, 2025Updated last year
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆49Dec 22, 2023Updated 2 years ago
- ☆54Jul 16, 2024Updated last year