EleutherAI / github-downloader
Script for downloading GitHub.
☆91Updated 8 months ago
Alternatives and similar repositories for github-downloader:
Users that are interested in github-downloader are comparing it to the libraries listed below
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆81Updated last year
- Repository for analysis and experiments in the BigCode project.☆117Updated last year
- ☆77Updated last year
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆107Updated 2 years ago
- ☆89Updated 2 years ago
- ☆29Updated last year
- ☆51Updated 2 weeks ago
- ☆74Updated last year
- Accepted by Transactions on Machine Learning Research (TMLR)☆126Updated 5 months ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆47Updated last year
- An experimental implementation of the retrieval-enhanced language model☆74Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- ☆97Updated 2 years ago
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆44Updated 2 months ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆119Updated 2 years ago
- xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval☆78Updated 6 months ago
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for so…☆55Updated last year
- Open Implementations of LLM Analyses☆102Updated 5 months ago
- Source code for paper: INTERVENOR : Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing☆26Updated 3 months ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆179Updated 2 years ago
- Experiments with generating opensource language model assistants☆97Updated last year
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)☆85Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆103Updated last year
- ☆42Updated last month
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆147Updated 7 months ago
- Downloads 2020 English Wikipedia articles as plaintext☆23Updated last year
- Dataset and code for Findings of EMNLP'21 paper "CodeQA: A Question Answering Dataset for Source Code Comprehension".☆41Updated last year
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆71Updated last year
- ☆147Updated 4 years ago