A Pre-trained BERT on StackOverflow Corpus
☆46Feb 27, 2021Updated 5 years ago
Alternatives and similar repositories for BERTOverflow
Users that are interested in BERTOverflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source Code and Data for Software Domain NER☆147Dec 21, 2022Updated 3 years ago
- ☆14Jan 13, 2023Updated 3 years ago
- A collection of publications that works on code models but beyond focusing on the accuracies.☆13Jun 30, 2023Updated 2 years ago
- HashtagMaster: Segmentation tool for hashtags☆12Oct 27, 2020Updated 5 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11May 7, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository will contain the data and codes for WNUT 2020 NER task☆52Dec 21, 2022Updated 3 years ago
- Characterizing the natural language descriptions in software logging statements [ASE'18]☆17Dec 5, 2018Updated 7 years ago
- ☆16Jun 20, 2017Updated 8 years ago
- TeLL: Log Level Suggestions via Modeling Multi-Level Code Block Information, ISSTA'22☆14Jul 14, 2022Updated 3 years ago
- Subword based Pairwise Word Interaction Model for Paraphrase Identification☆22Jun 11, 2018Updated 7 years ago
- Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding☆11May 23, 2024Updated last year
- ☆15Nov 5, 2020Updated 5 years ago
- TDCleaner: A Tool for Detecting Obsolete TODO Comments in Software Repos☆12Dec 9, 2021Updated 4 years ago
- BuGL - A Cross-Language Dataset for Bug Localization☆10Feb 8, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code Snippet Recommendation from Stack Overflow Post☆19Jun 30, 2021Updated 4 years ago
- FOCUS is a context-aware collaborative-filtering system that exploits cross relationships among OSS projects to suggest the inclusion of …☆21Jun 14, 2023Updated 2 years ago
- A dataset for natural language code search.☆14Feb 13, 2020Updated 6 years ago
- Small semi-manual annotated web news corpus in Swedish for CoreNLP NER. 4 categories, PER, ORG, LOC and MISC.☆12Jun 27, 2020Updated 5 years ago
- This is Research Artifact for DevGPT Dataset☆54Jul 24, 2024Updated last year
- A Python implementation of the Sequential Thinking MCP server using the official Model Context Protocol (MCP) Python SDK. This server fac…☆24Jun 1, 2025Updated 11 months ago
- ☆27Apr 7, 2026Updated 3 weeks ago
- AVATAR: Fixing Semantic Bugs with Fix Patterns of Static Analysis Violations☆26Apr 26, 2021Updated 5 years ago
- ☆14Dec 18, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Your library for dynamic language modeling☆67Oct 23, 2018Updated 7 years ago
- Stacked Denoising BERT for Noisy Text Classification (Neural Networks 2020)☆33Nov 28, 2022Updated 3 years ago
- ☆33Jun 12, 2023Updated 2 years ago
- ☆12Oct 29, 2022Updated 3 years ago
- Code for Neural Coreference Resolution for Arabic☆12May 12, 2022Updated 3 years ago
- ACL 2023 (Findings) End-to-end Cross-lingual Label Project☆15Nov 24, 2023Updated 2 years ago
- Benchmarks for Kaggle's Predict Closed Questions on Stack Overflow competition☆55Mar 19, 2016Updated 10 years ago
- Paper: "Predicting Subjective Features from Questions on QA Websites using BERT"☆14May 22, 2022Updated 3 years ago
- A fictional desktop session of the Pokémon's Pr Chen. Imitating a Windows 11 like user interface, the project has been made using Vite, V…☆11Sep 24, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- CLCDSA: Cross Language Code Clone Detection using Syntactical Features and API Documentation☆22Jun 29, 2025Updated 10 months ago
- Implementation of End-to-End Query Term Weighting (TW-BERT)☆35Jun 29, 2025Updated 10 months ago
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- 📒Record some paper read notes☆20Jan 1, 2022Updated 4 years ago
- A dataset of reproducible breaking dependency updates, SANER 2024 (https://doi.org/10.1109/SANER60148.2024.00024)☆22Updated this week
- Annotated corpus and code for "Extracting COVID-19 Events from Twitter".☆44May 19, 2022Updated 3 years ago
- ☆10Feb 9, 2019Updated 7 years ago