An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
☆13Jun 7, 2023Updated 3 years ago
Alternatives and similar repositories for gpt-neox
Users that are interested in gpt-neox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.☆55Oct 29, 2025Updated 8 months ago
- French to English translator on character level implemented by Keras☆10Jun 15, 2017Updated 9 years ago
- ☆13Aug 28, 2018Updated 7 years ago
- Sentencepiece based BPE tokenizer for English and Japanese language text.☆29Apr 4, 2024Updated 2 years ago
- Japanese instruction data (日本語指示データ)☆24Jul 13, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆21Nov 28, 2022Updated 3 years ago
- ☆39Jan 27, 2026Updated 5 months ago
- ☆11Dec 13, 2023Updated 2 years ago
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- A magic notepad. δ☆14May 21, 2023Updated 3 years ago
- pix2pix keras implemantation by tommy☆15Dec 26, 2017Updated 8 years ago
- ☆89Jul 25, 2023Updated 2 years ago
- Regedit but with Mica, GlowUI, Search, Tabs, History, Editing and more☆10Apr 23, 2022Updated 4 years ago
- 青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット☆22Jan 17, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20Sep 18, 2025Updated 9 months ago
- High-performance computing using Kavli IPMU clusters (specifically idark).☆13Apr 20, 2023Updated 3 years ago
- Official implementation for the AAAI2025 paper "PIXELS - Progressive Image Xemplar-based Editing with Latent Surgery"☆11Dec 17, 2024Updated last year
- Code and documentation to train Stanford's Alpaca models, and generate the data.