lanwuwei / BERTOverflowLinks

A Pre-trained BERT on StackOverflow Corpus

☆47

Alternatives and similar repositories for BERTOverflow

Users that are interested in BERTOverflow are comparing it to the libraries listed below

Sorting:

jeniyat / StackOverflowNER
Source Code and Data for Software Domain NER
☆146Updated 2 years ago
panthap2 / LearningToUpdateNLComments
Learning to Update Natural Language Comments Based on Code Changes: Artifact
☆33Updated 4 years ago
neulab / external-knowledge-codegen
Code and data for ACL20 paper "Incorporating External Knowledge through Pre-training for Natural Language to Code Generation"
☆97Updated 2 years ago
LittleYUYU / StackOverflow-Question-Code-Dataset
StaQC: a systematically mined dataset containing around 148K Python and 120K SQL domain question-code pairs, as described in "StaQC: A Sy…
☆169Updated 3 years ago
tech-srl / c3po
Code for the paper "A Structural Model for Contextual Code Changes"
☆32Updated last year
SEntiMoji / SEntiMoji
data, code, pre-trained models and experiment results for "SEntiMoji: An Emoji-Powered Learning Approach for Sentiment Analysis in Softwa…
☆34Updated last year
LittleYUYU / CoaCor
Code for "CoaCor: Code Annotation for Code Retrieval with Reinforcement Learning" (WWW 2019)
☆37Updated 5 years ago
microsoft / Search4Code
Web queries dataset for code search
☆32Updated 2 years ago
CoderPat / structured-neural-summarization
A repository with the code for the paper with the same title
☆74Updated 6 years ago
facebookresearch / Neural-Code-Search-Evaluation-Dataset
evaluation dataset consisting of natural language query and code snippet pairs
☆124Updated last year
nokia / codesearch
Models and datasets for annotated code search.
☆35Updated 2 years ago
Jun-jie-Huang / CoCLR
Source Code for ACL-21 main conference paper "CoSQA: 20,000+ Web Queries for Code Search and Question Answering".
☆45Updated 2 years ago
csebuetnlp / CoDesc
A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.
☆53Updated 3 years ago
rajasagashe / JuICe
Code for generating the JuICe dataset.
☆37Updated 3 years ago
sriniiyer / concode
Mapping Language to Code in a Programmatic Context
☆80Updated 4 years ago
allenai / allennlp-semparse
A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLP
☆108Updated 3 years ago
wasiahmad / NeuralCodeSum
Official implementation of our work, A Transformer-based Approach for Source Code Summarization [ACL 2020].
☆193Updated 3 years ago
sf-wa-326 / phrase-bert-topic-model
☆87Updated 3 years ago
microsoft / msrc-dpu-learning-to-represent-edits
C# Data Extraction for "Learning to Represent Edits"
☆26Updated 6 years ago
jeniyat / WNUT_2020_NER
This repository will contain the data and codes for WNUT 2020 NER task
☆51Updated 2 years ago
yg211 / acl20-ref-free-eval
SUPERT: Unsupervised multi-document summarization evaluation & generation
☆94Updated 2 years ago
microsoft / iclr2019-learning-to-represent-edits
Code for the ICLR 2019 paper "Learning to Represent Edits"
☆12Updated 2 years ago
zichaow / QG-Net
code for QG-Net: A Data-Driven Question Generation Model for Educational Content
☆49Updated 5 years ago
sweetpeach / ReCode
☆28Updated 3 years ago
giganticode / codeprep
A toolkit for pre-processing large source code corpora
☆47Updated 2 years ago
harperco / MeasEval
SemEval-2021 Task 8: MeasEval data and other bits
☆48Updated 3 years ago
KaijuML / data-to-text-hierarchical
Code for A Hierarchical Model for Data-to-Text Generation (Rebuffel, Soulier, Scoutheeten, Gallinari; ECIR 2020)
☆81Updated last year
BASE-LAB-SJTU / CosBench
A dataset for natural language code search.
☆14Updated 5 years ago
amazonqa / amazonqa
Evidence-based QA system for community question answering.
☆107Updated 4 years ago
sunlab-osu / MISP
Model-based Interactive Semantic Parsing (MISP) framework
☆53Updated last year