frankxu2004 / gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
☆17Updated 2 years ago
Alternatives and similar repositories for gpt-neox
Users that are interested in gpt-neox are comparing it to the libraries listed below
Sorting:
- ☆14Updated last year
- ☆52Updated 2 months ago
- ☆44Updated 11 months ago
- distill chatGPT coding ability into small model (1b)☆29Updated last year
- Repository for analysis and experiments in the BigCode project.☆118Updated last year
- PyTorch library for synthesizing programs from natural language☆18Updated 9 months ago
- ☆75Updated last month
- ☆16Updated 5 years ago
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆108Updated 2 years ago
- ☆78Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- A Python implementation of Toolformer using Huggingface Transformers☆14Updated 2 years ago
- ☆30Updated last year
- Codebase for plCoP, a Prolog Technology Reinforcement Learning Prover☆12Updated 4 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆48Updated last year
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated 9 months ago
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆14Updated 3 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated 2 years ago
- Implementation of a Tensorflow XLA rematerialization pass☆15Updated 5 years ago
- Evaluation suite for large-scale language models.☆125Updated 3 years ago
- Reasoning by Communicating with Agents☆28Updated 2 weeks ago
- Fault-aware neural code rankers☆28Updated 2 years ago
- Semantic Code Search☆35Updated 2 years ago
- Code for generating the JuICe dataset.☆37Updated 3 years ago
- A collection of models built with ColossalAI☆32Updated 2 years ago
- ☆28Updated 3 years ago
- Script for downloading GitHub.☆93Updated 10 months ago
- Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"☆29Updated 2 years ago
- ☆26Updated 2 years ago