EleutherAI / github-downloaderLinks

Script for downloading GitHub.

☆96

Alternatives and similar repositories for github-downloader

Users that are interested in github-downloader are comparing it to the libraries listed below

Sorting:

bigcode-project / bigcode-analysis
Repository for analysis and experiments in the BigCode project.
☆121Updated last year
EleutherAI / stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
☆83Updated last year
CarperAI / Code-Pile
This repository contains all the code for collecting large scale amounts of code from GitHub.
☆110Updated 2 years ago
EleutherAI / openwebtext2
☆90Updated 3 years ago
shuyanzhou / docprompting
Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023
☆248Updated last year
dpfried / incoder
Generative model for code infilling and synthesis
☆304Updated last year
Zyq-scut / RLTF
Accepted by Transactions on Machine Learning Research (TMLR)
☆130Updated 10 months ago
openai / human-eval-infilling
Code for the paper "Efficient Training of Language Models to Fill in the Middle"
☆183Updated 2 years ago
EleutherAI / lm_perplexity
☆153Updated 4 years ago
leogao2 / lm_dataformat
☆79Updated last year
shrivastavadisha / repo_level_prompt_generation
☆124Updated 2 years ago
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago
neulab / code-bert-score
CodeBERTScore: an automatic metric for code generation, based on BERTScore
☆196Updated last year
zorazrw / odex
[EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation
☆48Updated last year
google-research / babelcode
☆52Updated 5 months ago
LAION-AI / Open-Instruction-Generalist
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
☆208Updated last year
nyu-mll / ILF-for-code-generation
☆78Updated 4 months ago
AI21Labs / lm-evaluation
Evaluation suite for large-scale language models.
☆127Updated 3 years ago
explodinggradients / nemesis
Reward Model framework for LLM RLHF
☆61Updated 2 years ago
bigcode-project / bigcode-encoder
☆30Updated 2 years ago
terryyz / ice-score
[EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code
☆76Updated last year
salesforce / jaxformer
Minimal library to train LLMs on TPU in JAX with pjit().
☆292Updated last year
Langboat / mengzi-retrieval-lm
An experimental implementation of the retrieval-enhanced language model
☆75Updated 2 years ago
ntunlp / xCodeEval
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
☆86Updated 10 months ago
reddy-lab-code-research / StructCoder
Code for "StructCoder: Structure-Aware Transformer for Code Generation"
☆76Updated last year
LLM360 / crystalcoder-train
Pre-training code for CrystalCoder 7B LLM
☆55Updated last year
CarperAI / InstructGPT
For experiments involving instruct gpt. Currently used for documenting open research questions.
☆71Updated 2 years ago
google-research / longt5
☆182Updated 2 years ago
young-geng / koala_data_pipeline
The data processing pipeline for the Koala chatbot language model
☆117Updated 2 years ago
xingyaoww / LeTI
Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."
☆64Updated 2 years ago