marcusm117 / IdentityChain
[ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain
☆8Updated 10 months ago
Alternatives and similar repositories for IdentityChain:
Users that are interested in IdentityChain are comparing it to the libraries listed below
- Code for the AAAI 2023 paper "CodeAttack: Code-based Adversarial Attacks for Pre-Trained Programming Language Models☆29Updated last year
- ☆13Updated 4 months ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆30Updated 9 months ago
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆14Updated 2 years ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆52Updated last year
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Updated last year
- [NeurIPS'24] SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning☆20Updated 4 months ago
- ☆20Updated 2 years ago
- Baselines for all tasks from Long Code Arena benchmarks 🏟️☆28Updated this week
- [ICLR 2021] "Generating Adversarial Computer Programs using Optimized Obfuscations" by Shashank Srikant, Sijia Liu, Tamara Mitrovska, Shi…☆29Updated 3 years ago
- ☆42Updated last month
- ☆28Updated 4 months ago
- ☆33Updated last year
- ☆23Updated 5 months ago
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆22Updated 2 years ago
- Source Code Data Augmentation for Deep Learning: A Survey.☆64Updated 9 months ago
- [LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization☆36Updated 3 weeks ago
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆29Updated 2 years ago
- Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"☆14Updated last week
- Training and Benchmarking LLMs for Code Preference.☆33Updated 4 months ago
- The CodeInsight dataset is designed for code generation tasks, providing developers with expert-curated examples that bridge the gap betw…☆13Updated 5 months ago
- Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"☆41Updated 2 months ago
- Mostly recording papers about models' trustworthy applications. Intending to include topics like model evaluation & analysis, security, c…☆20Updated last year
- Code and Results of the Paper: On the Resilience of Multi-Agent Systems with Malicious Agents☆19Updated 2 months ago
- ☆28Updated 2 years ago
- Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond☆21Updated last year
- This is the project repository of our ASE22 paper: Natural Test Generation for Precise Testing of Question Answering Software☆14Updated 2 years ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆89Updated 10 months ago
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆62Updated 2 years ago
- Adversarial Robustness for Code☆15Updated 4 years ago