YihongDong / CDD-TED4LLMs
☆12Updated last month
Related projects ⓘ
Alternatives and complementary repositories for CDD-TED4LLMs
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆48Updated 7 months ago
- ☆40Updated 2 months ago
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆59Updated 2 years ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆25Updated last year
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆42Updated 2 months ago
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆46Updated 2 months ago
- Generate the WizardCoder Instruct from the CodeAlpaca☆20Updated last year
- ☆25Updated this week
- Code and data for AAAI 2022 paper "Multilingual Code Snippets Training for Program Translation"☆10Updated 2 years ago
- code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"☆12Updated last year
- ☆43Updated 2 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆75Updated 7 months ago
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆22Updated 11 months ago
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆53Updated 3 months ago
- Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"☆24Updated 11 months ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆13Updated last year
- ☆10Updated last year
- Repoformer: Selective Retrieval for Repository-Level Code Completion (ICML 2024)☆38Updated 4 months ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Updated last year
- [EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages☆22Updated last year
- NaturalCodeBench (Findings of ACL 2024)☆56Updated 3 weeks ago
- ☆25Updated last month
- VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning☆38Updated last year
- This is the official implement for the paper 'Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases''☆13Updated last year
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆27Updated 4 months ago
- ☆82Updated last year
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆16Updated 2 years ago
- Benchmarking LLMs' Emotional Alignment with Humans☆63Updated last month
- This repo is the benchmark for source code summarization on C language☆24Updated 3 years ago
- https://openreview.net/forum?id=OC1o4_OI6Jw☆13Updated 2 years ago