ChiYeungLaw / Awsome-Code-Intelligence

In this repository, you'll find a curated selection of recent research papers, articles, and implementations from leading experts in the field of Code Intelligence.

☆16

Alternatives and similar repositories for Awsome-Code-Intelligence:

Users that are interested in Awsome-Code-Intelligence are comparing it to the libraries listed below

Ablustrund / APPS_Plus
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
☆62Updated 5 months ago
ise-uiuc / xft
XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts
☆29Updated 7 months ago
thunlp / DebugBench
The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".
☆62Updated 7 months ago
YihongDong / CDD-TED4LLMs
☆13Updated 2 months ago
reddy-lab-code-research / PPOCoder
Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
☆108Updated last year
DeepSoftwareAnalytics / RLCoder
Reinforcement Learning for Repository-Level Code Completion
☆22Updated 6 months ago
SparksofAGI / MHPP
☆28Updated 3 months ago
NingMiao / SelfCheck
Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>
☆48Updated last year
Alex-HaochenLi / RACS
[EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'
☆26Updated last year
zhuohaoyu / KIEval
[ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models
☆36Updated 7 months ago
reddy-lab-code-research / CodeAttack
Code for the AAAI 2023 paper "CodeAttack: Code-based Adversarial Attacks for Pre-Trained Programming Language Models
☆26Updated last year
seketeam / EvoCodeBench
An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories
☆49Updated 6 months ago
NL2Code / NL2Code.github.io
Large Language Models Meet NL2Code: A Survey
☆36Updated 3 months ago
THUDM / NaturalCodeBench
NaturalCodeBench (Findings of ACL 2024)
☆62Updated 4 months ago
amazon-science / cceval
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)
☆130Updated 6 months ago
zkx06111 / ALGO
☆33Updated last year
ntunlp / xCodeEval
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
☆77Updated 5 months ago
KwanWaiChung / MT-Eval
Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"
☆35Updated 4 months ago
FloatAI / humaneval-xl
[LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization
☆33Updated last month
pillowsofwind / Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆103Updated 4 months ago
oceaneLIU / GraphCoder
☆30Updated 8 months ago
adf1178 / PT4Code
☆46Updated 2 years ago
JohnnyPeng18 / APIBench
APIBench is a benchmark for evaluating the performance of API recommendation approaches released in the paper "Revisiting, Benchmarking a…
☆53Updated last year
Princeton-SysML / kNNLM_privacy
Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888
☆35Updated 8 months ago
nju-websoft / DraCo
Dataflow-guided retrieval augmentation for repository-level code completion, ACL 2024 (main)
☆21Updated 8 months ago
Lordog / R-Judge
R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)
☆65Updated last week
princeton-nlp / LLMBar
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
☆119Updated 7 months ago
tengxiaoliu / XoT
[EMNLP 2023] Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
☆26Updated last year
PlusLabNLP / Active-IT
Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"
☆24Updated last year