NathanaelBeau / CodeInsightLinks
The CodeInsight dataset is designed for code generation tasks, providing developers with expert-curated examples that bridge the gap between conceptual intent and functional code. Published @ACL24.
☆14Updated last year
Alternatives and similar repositories for CodeInsight
Users that are interested in CodeInsight are comparing it to the libraries listed below
Sorting:
- This is the official implement for the paper 'Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases''☆14Updated 2 years ago
- [LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization☆41Updated 11 months ago
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆65Updated 3 years ago
- ☆16Updated last year
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆67Updated last year
- Baselines for all tasks from Long Code Arena benchmarks 🏟️☆39Updated 10 months ago
- Generate the WizardCoder Instruct from the CodeAlpaca☆21Updated 2 years ago
- [EMNLP 2025 Findings] Familiarity-aware Evidence Compression for Retrieval Augmented Generation☆14Updated 5 months ago
- Data creation, training and eval scripts for the IRCoder paper☆20Updated last year
- Replication package for EMNLP2022 paper- RACE: Retrieval-Augmented Commit Message Generation☆20Updated 3 years ago
- ☆13Updated 2 years ago
- ☆48Updated 3 years ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Updated last year
- ☆46Updated 3 months ago
- This is the repo for our work “An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation” (ACL 2023).☆14Updated 2 years ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆119Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆151Updated last year
- ☆22Updated 2 years ago
- Adversarial Attack for Pre-trained Code Models☆10Updated 3 years ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆129Updated last year
- Repo for paper: Controllable Text Generation with Language Constraints☆20Updated 2 years ago
- ☆38Updated 2 years ago
- [ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain☆10Updated 2 months ago
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆59Updated last year
- ☆33Updated 4 months ago
- [ACL 2023] Code for ContraCLM: Contrastive Learning For Causal Language Model☆35Updated 2 years ago
- Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond☆23Updated 2 years ago
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆36Updated 2 years ago
- Source Code for ACL-21 main conference paper "CoSQA: 20,000+ Web Queries for Code Search and Question Answering".☆46Updated 3 years ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆20Updated last year