frankxu2004 / gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
☆17Updated 2 years ago
Alternatives and similar repositories for gpt-neox:
Users that are interested in gpt-neox are comparing it to the libraries listed below
- ☆44Updated 10 months ago
- ☆51Updated 3 weeks ago
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆42Updated 2 years ago
- ☆75Updated last week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated 2 years ago
- ☆14Updated last year
- Background materials for the article "Productivity Assessment of Neural Code Completion"☆12Updated last year
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆87Updated 2 years ago
- Semantic Code Search☆35Updated 2 years ago
- Repository for analysis and experiments in the BigCode project.☆117Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆33Updated 2 years ago
- Script for downloading GitHub.☆91Updated 9 months ago
- ☆26Updated 2 years ago
- Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)☆58Updated 3 years ago
- ☆30Updated last year
- A server powering LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆13Updated 2 years ago
- ☆37Updated 2 years ago
- This is the repository for the paper Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descripti…☆25Updated 2 years ago
- Accepted by Transactions on Machine Learning Research (TMLR)☆126Updated 5 months ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆81Updated last year
- ☆13Updated 4 months ago
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Updated 4 years ago
- Fault-aware neural code rankers☆28Updated 2 years ago
- We release the UICaption dataset. The dataset consists of UI images (icons and screenshots) and associated text descriptions. This datase…☆38Updated 2 years ago
- PROSE Public Benchmark Suite☆24Updated 6 months ago
- One stop shop for all things carp☆59Updated 2 years ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆52Updated last year
- Code for the paper "Efficient Training of Language Models to Fill in the Middle"☆176Updated 2 years ago