FSoft-AI4Code/DocChecker

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FSoft-AI4Code/DocChecker)

FSoft-AI4Code / DocChecker

DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment

☆15

Alternatives and similar repositories for DocChecker

Users that are interested in DocChecker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FSoft-AI4Code / CodeText-parser
View on GitHub
⚒️ Tree-sitter custom toolkit for extracting function and class from raw source file
☆52Jul 1, 2024Updated 2 years ago
andvg3 / LSDM
View on GitHub
Dataset and Code for NeurIPS 2023 paper "Language-driven Scene Synthesis using Multi-conditional Diffusion Model."
☆48Aug 8, 2024Updated last year
FSoft-AI4Code / CodeMMLU
View on GitHub
[ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.
☆29Apr 21, 2025Updated last year
FSoft-AI4Code / RepoExec
View on GitHub
[NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…
☆45Jan 8, 2026Updated 6 months ago
FSoft-AI4Code / TheVault
View on GitHub
[EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
☆105Aug 21, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
marcusm117 / IdentityChain
View on GitHub
[ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain
☆11Nov 24, 2025Updated 7 months ago
airalcorn2 / paved2paradise
View on GitHub
Cost-effective and scalable LiDAR simulation by factoring the real world.
☆12Dec 8, 2025Updated 7 months ago
google-research / vet
View on GitHub
☆17Jun 5, 2024Updated 2 years ago
VietHoang1512 / MT-SGD
View on GitHub
Stochastic Multiple Target Sampling Gradient Descent (NeurIPS 2022)
☆13Sep 19, 2022Updated 3 years ago
LIANGQINGYUAN / awesome_codegeneration
View on GitHub
A list of papers and resources dedicated to code generation
☆21Nov 2, 2022Updated 3 years ago
XuyangShen / Non-binary-deep-transfer-learning-for-image-classification
View on GitHub
Supplementary Material for Non-binary Deep Transfer Learning for Image Classification
☆18Jul 22, 2021Updated 4 years ago
dashends / CodeSyntax
View on GitHub
Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"
☆16Oct 24, 2022Updated 3 years ago
mruettgers / preact-template-esp
View on GitHub
A simple, minimal "Hello World" template for Preact CLI for being used on a ESP8266/ESP32
☆12Dec 10, 2021Updated 4 years ago
LC1332 / Learn-Python-with-GPT
View on GitHub
李鲁鲁老师的 Copilot-Python 学习。和ChatGPT等大语言模型协同进化。
☆10Jun 3, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
FSoft-AI4Code / XMainframe
View on GitHub
Language Model for Mainframe Modernization
☆74Aug 23, 2024Updated last year
giganticode / probes
View on GitHub
Probing pre-trained source code models
☆15Apr 27, 2022Updated 4 years ago
VietHoang1512 / khmer-nltk
View on GitHub
Khmer natural language processing toolkit
☆84Mar 17, 2026Updated 4 months ago
tranquyenbk173 / BERT_ITE
View on GitHub
Official implementation of "From Implicit to Explicit Feedback: A deep neural network for modeling sequential behaviours and long-short t…
☆19Oct 16, 2025Updated 9 months ago
bdqnghi / awesome-ai4code
View on GitHub
A collection of recent papers, benchmarks and datasets of AI4Code domain.
☆61Apr 23, 2024Updated 2 years ago
tranquyenbk173 / FCRE-via-MMI
View on GitHub
Preserving Generalization of Language Models in Few-shot Continual Relation Extraction (EMNLP2024)
☆17Nov 21, 2024Updated last year
abvijaykumar / python-lora-finetuning
View on GitHub
Finetuning a codegen model with python instruction set using QLORA technique for better efficacy
☆11Aug 31, 2023Updated 2 years ago
YiQi0318 / LLMs_daily_arxiv
View on GitHub
☆15Jul 25, 2024Updated last year
UKPLab / codeclarqa
View on GitHub
Asking Clarification Questions for Code Generation in General-Purpose Programming Language
☆11May 26, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ARiSE-Lab / CONCORD_ISSTA_23
View on GitHub
Open-source repository for the ISSTA'23 paper "CONCORD: Clone-aware Contrastive Learning for Source Code"
☆11Nov 10, 2023Updated 2 years ago
jin-guo / COMP585
View on GitHub
☆12Aug 28, 2025Updated 10 months ago
FNakano / CFA
View on GitHub
Computação Física e Aplicações
☆25Updated this week
floatai / HumanEval-XL
View on GitHub
[LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization
☆42Mar 7, 2025Updated last year
khtee / text-classification-pytorch
View on GitHub
Pytorch implementation of RNN, CNN, BiGRU and LSTM for text classifcation
☆10Apr 30, 2021Updated 5 years ago
taesiri / ZoomIsAllYouNeed
View on GitHub
Official code and data for NeurIPS 2023 paper "ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial …
☆42Dec 13, 2023Updated 2 years ago
Naplesoul / LSM-KV
View on GitHub
A key-value storage system based on LSM Tree, using Skip List and Bloom Filter to accelerate.
☆10Jun 10, 2021Updated 5 years ago
FSoft-AI4Code / HyperAgent
View on GitHub
Generalist Software Agents to Solve Soware Engineering Tasks
☆247Dec 10, 2024Updated last year
salmansherin / QExplore
View on GitHub
QExplore is a dynamic automatic exploration tool for dynamic web applications. It reverse engineers a state-flow model that can be used t…
☆13Mar 6, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Fsoft-AIC / Grasp-Anything
View on GitHub
Dataset and Code for ICRA 2024 paper "Grasp-Anything: Large-scale Grasp Dataset from Foundation Models."
☆229Jun 26, 2024Updated 2 years ago
jie-jw-wu / Survey-CodeLLM4LowResource-DSL
View on GitHub
A Survey on LLM-based Code Generation for Low-Resource and Domain-Specific Programming Languages
☆20Jun 11, 2026Updated last month
jacquesboitreaud / vina_docking
View on GitHub
Python scripts for molecular docking of molecules vs DUDE protein targets, using Vina
☆25Apr 20, 2020Updated 6 years ago
microsoft / WaveCoder
View on GitHub
Advancing LLM with Diverse Coding Capabilities
☆80Jul 25, 2024Updated last year
awsm-research / ChatGPT4Vul
View on GitHub
☆16Nov 24, 2023Updated 2 years ago
MolSSI-Education / getting-started-computational-chemistry
View on GitHub
☆20Jan 23, 2021Updated 5 years ago
jamesmurdza / humaneval-langchain
View on GitHub
Benchmark results from code generation with LLMs
☆17Sep 1, 2023Updated 2 years ago