frankxu2004 / gpt-neoxLinks
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
☆17Updated 3 years ago
Alternatives and similar repositories for gpt-neox
Users that are interested in gpt-neox are comparing it to the libraries listed below
Sorting:
- ☆14Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.☆168Updated 3 weeks ago
- Downloads 2020 English Wikipedia articles as plaintext☆25Updated 2 years ago
- MozoLM: A language model (LM) serving library☆45Updated this week
- SLIDE (Sub-LInear Deep learning Engine) written in Go☆45Updated 5 years ago
- ☆52Updated 5 months ago
- Evaluation suite for large-scale language models.☆127Updated 3 years ago
- ☆44Updated last year
- ☆26Updated last year
- ☆21Updated 3 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- Scripts to parse arxiv documents for NLP tasks☆18Updated 2 years ago
- Repository for analysis and experiments in the BigCode project.☆121Updated last year
- Script for downloading GitHub.☆96Updated last year
- ☆26Updated 2 years ago
- Minimal library to train LLMs on TPU in JAX with pjit().☆293Updated last year
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆110Updated 2 years ago
- a list of StrongAI related resources.☆11Updated 2 years ago
- ☆21Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆83Updated last year
- Web queries dataset for code search☆32Updated 2 years ago
- Generative model for code infilling and synthesis☆304Updated last year
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆16Updated last year
- Pre-training code for CrystalCoder 7B LLM☆55Updated last year
- benchmarking some transformer deployments☆26Updated 2 years ago
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆45Updated 2 years ago
- Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"☆30Updated 2 years ago
- ☆42Updated 7 months ago
- 3rd party dependencies for DALI project☆10Updated last week
- Semantic Code Search☆36Updated 2 years ago