rsain / GitHub-CrawlerLinks
A Python script to collect data from GitHub using its API.
☆43Updated 2 years ago
Alternatives and similar repositories for GitHub-Crawler
Users that are interested in GitHub-Crawler are comparing it to the libraries listed below
Sorting:
- A toolkit for pre-processing large source code corpora☆47Updated 2 years ago
- This repository contains an implementation for design patterns detection. In this task, feature engineering and ensemble learning are app…☆10Updated 2 years ago
- ☆28Updated 2 years ago
- Code for the paper "A Structural Model for Contextual Code Changes"☆32Updated last year
- ☆49Updated 2 years ago
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆12Updated last month
- ☆24Updated 3 years ago
- ManyTypes4Py: A benchmark Python dataset for machine learning-based type inference☆23Updated 3 years ago
- Deep Just-In-Time Inconsistency Detection Between Comments and Source Code: Artifact☆22Updated 2 years ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆53Updated last year
- Learning to Update Natural Language Comments Based on Code Changes: Artifact☆33Updated 4 years ago
- repo for the paper titled “CodeGen4Libs: A Two-Stage Approach for Library-Oriented Code Generation”☆15Updated last year
- CoditT5: Pretraining for Source Code and Natural Language Editing☆28Updated 5 months ago
- ☆20Updated 2 years ago
- A curated list of software engineering research, data set, tool.☆32Updated 2 years ago
- ☆66Updated 3 years ago
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15Updated 3 years ago
- Implementation of "Automatic Source Code Summarization with Extended Tree-LSTM"☆36Updated 2 years ago
- an implementation of "code2vec: Learning Distributed Representations of Code"☆30Updated 11 months ago
- A Comparative Study of Various Code Embeddings in Software Semantic Matching☆16Updated 2 years ago
- Neural Code Translator provides instructions, datasets, and a deep learning infrastructure (based on seq2seq) that aims at learning code …☆38Updated 6 years ago
- ☆18Updated 2 years ago
- ☆13Updated 3 years ago
- BugsInPy: Benchmarking Bugs in Python Projects☆104Updated 11 months ago
- Improving Machine Translation Systems via Isotopic Replacement☆12Updated 2 years ago
- ☆36Updated 3 years ago
- Web queries dataset for code search☆32Updated 2 years ago
- TDCleaner: A Tool for Detecting Obsolete TODO Comments in Software Repos☆11Updated 3 years ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆53Updated 3 years ago
- RepairAgent is an autonomous LLM-based agent for software repair.☆47Updated 2 weeks ago