Source code embeddings for various programming languages
☆16Jul 11, 2018Updated 7 years ago
Alternatives and similar repositories for source2vec
Users that are interested in source2vec are comparing it to the libraries listed below
Sorting:
- 基于CodeBert预训练模型,微调后/直接对目标数据集进行测试☆14Oct 19, 2021Updated 4 years ago
- This is the official implement for the paper 'Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases''☆14Oct 4, 2023Updated 2 years ago
- A Comparative Study of Various Code Embeddings in Software Semantic Matching☆18Dec 8, 2022Updated 3 years ago
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆26Aug 31, 2022Updated 3 years ago
- ☆12Jan 17, 2026Updated last month
- Source Code for "A multi-modal transformer-based code summarization approach for smart contracts"☆27Mar 16, 2021Updated 4 years ago
- SimADFuzz: Simulation-Feedback Fuzz Testing for Autonomous Driving Systems☆10Apr 11, 2025Updated 10 months ago
- Python3入门机器学习 经典算法与应用 学习☆11Nov 9, 2018Updated 7 years ago
- Source code for ISSTA'24 paper "AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation"☆12Oct 21, 2024Updated last year
- This repository contains 4000 vulnerable hardware designs. Currently this is in Jsonl format for directly using it for fine-tuning LLMs. …☆21Mar 25, 2025Updated 11 months ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- This repo is the artifact of FUEL☆13Dec 2, 2025Updated 2 months ago
- Text preprocessing package for use in NLP tasks https://pypi.org/project/textcl/☆11Aug 9, 2024Updated last year
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- The code implementation of GraCeFul (Accepted in COLING 2025)☆13Jan 27, 2025Updated last year
- [COLING25] CodeJudge Eval: Can Large Language Models be Good Judges in Code Understanding?☆12Dec 3, 2024Updated last year
- Semantic Scaffolds for Pseudocode-to-Code Generation (accepted by ACL 2020)☆14Jun 7, 2021Updated 4 years ago
- DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment☆15Jan 23, 2024Updated 2 years ago
- A python library that supports all vector databases specifically for LLM apps and frameworks☆13May 3, 2023Updated 2 years ago
- This repo contains a demo of adversarial strings poisoning vector database and forching specific hallucinations on RAG chatbot.☆10May 2, 2024Updated last year
- ☆11Jul 28, 2021Updated 4 years ago
- ☆13Sep 11, 2023Updated 2 years ago
- Compares two images using Siamese Network (machine learning) trained from a Pytorch Implementation☆10Jul 27, 2021Updated 4 years ago
- ☆11May 24, 2020Updated 5 years ago
- CodeRepoQA dataset☆15Feb 19, 2025Updated last year
- Generating Sentences from Disentangled Syntactic and Semantic Spaces☆11Jun 24, 2019Updated 6 years ago
- It is a PyCharm plugin of generating Python object variable Getter and Setter function automatically.☆11Oct 29, 2022Updated 3 years ago
- ☆11Oct 16, 2020Updated 5 years ago
- ☆48Nov 19, 2025Updated 3 months ago
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆12Oct 12, 2024Updated last year
- Program Transformation Tool for Java Methods☆11Sep 16, 2022Updated 3 years ago
- TDCleaner: A Tool for Detecting Obsolete TODO Comments in Software Repos☆12Dec 9, 2021Updated 4 years ago
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆14May 10, 2025Updated 9 months ago
- A library for mining of path-based representations of code (and more)☆299Nov 7, 2025Updated 3 months ago
- Kafka Connect Vespa sink connector☆17Apr 17, 2025Updated 10 months ago
- training BART from scratch☆12Dec 31, 2021Updated 4 years ago
- Learning to Recommend Method Names with Global Context☆13Jan 17, 2022Updated 4 years ago
- pytorch实现bert做seq2seq任务,使用unilm方案。☆10Apr 1, 2020Updated 5 years ago
- ☆17Mar 22, 2024Updated last year