DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment
☆15Jan 23, 2024Updated 2 years ago
Alternatives and similar repositories for DocChecker
Users that are interested in DocChecker are comparing it to the libraries listed below
Sorting:
- ⚒️ Tree-sitter custom toolkit for extracting function and class from raw source file☆51Jul 1, 2024Updated last year
- [NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…☆43Jan 8, 2026Updated last month
- Dataset and Code for NeurIPS 2023 paper "Language-driven Scene Synthesis using Multi-conditional Diffusion Model."☆48Aug 8, 2024Updated last year
- [ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.☆29Apr 21, 2025Updated 10 months ago
- [ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain☆10Nov 24, 2025Updated 3 months ago
- [EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation☆103Aug 21, 2024Updated last year
- A list of papers and resources dedicated to code generation☆20Nov 2, 2022Updated 3 years ago
- This repository to demonstrate an application built with Java 21 + SrpingBoot 3 + MyBatis including CRUD operations, authentication, rout…☆12Dec 1, 2024Updated last year
- 🚀 Vibe Stack - Docker setup for AI-powered coding with Vibe-Kanban + Claude Code | Secure secrets, browser-based VS Code, ready to deplo…☆42Feb 10, 2026Updated 2 weeks ago
- Advancing LLM with Diverse Coding Capabilities☆80Jul 25, 2024Updated last year
- SimADFuzz: Simulation-Feedback Fuzz Testing for Autonomous Driving Systems☆10Apr 11, 2025Updated 10 months ago
- This repo is the artifact of FUEL☆13Dec 2, 2025Updated 2 months ago
- Python3入门机器学习 经典算法与应用 学习☆11Nov 9, 2018Updated 7 years ago
- An Abstractive Summarization(for Datasets in English format) Implementation with Transformer and Pointer-generator☆12Dec 31, 2020Updated 5 years ago
- 李鲁鲁老师的 Copilot-Python 学习。和ChatGPT等大语言模型协同进化。☆10Jun 3, 2025Updated 8 months ago
- dify 知识库检索工具☆13Apr 3, 2025Updated 10 months ago
- This repository contains 4000 vulnerable hardware designs. Currently this is in Jsonl format for directly using it for fine-tuning LLMs. …☆20Mar 25, 2025Updated 11 months ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉 熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- Source code for ISSTA'24 paper "AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation"☆12Oct 21, 2024Updated last year
- [LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization☆41Mar 7, 2025Updated 11 months ago
- ☆12Updated this week
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago
- [COLING25] CodeJudge Eval: Can Large Language Models be Good Judges in Code Understanding?☆12Dec 3, 2024Updated last year
- Cost-effective and scalable LiDAR simulation by factoring the real world.☆12Dec 8, 2025Updated 2 months ago
- The code implementation of GraCeFul (Accepted in COLING 2025)☆13Jan 27, 2025Updated last year
- Open-source repository for the ISSTA'23 paper "CONCORD: Clone-aware Contrastive Learning for Source Code"☆11Nov 10, 2023Updated 2 years ago
- ☆11Jul 28, 2021Updated 4 years ago
- pytorch实现聊天机器人,seq2seq模型☆10Feb 9, 2020Updated 6 years ago
- Pytorch implementation of RNN, CNN, BiGRU and LSTM for text classifcation☆10Apr 30, 2021Updated 4 years ago
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆20Oct 29, 2025Updated 3 months ago
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 5 years ago
- Deliver LLMs of GGUF format via Dockerfile.☆14Oct 24, 2024Updated last year
- Semantic Scaffolds for Pseudocode-to-Code Generation (accepted by ACL 2020)☆14Jun 7, 2021Updated 4 years ago
- Collection of all language dataset for finetuning LLM☆10Apr 3, 2023Updated 2 years ago
- ☆11May 24, 2020Updated 5 years ago
- CodeBERT based mutation testing tool.☆13Nov 10, 2025Updated 3 months ago
- TDCleaner: A Tool for Detecting Obsolete TODO Comments in Software Repos☆12Dec 9, 2021Updated 4 years ago
- ☆14Jul 25, 2024Updated last year