Simple Annotated implementation of GPT-NeoX in PyTorch
☆110Aug 11, 2022Updated 3 years ago
Alternatives and similar repositories for neox
Users that are interested in neox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Differentiable FFT Conv Layer with Dense Color Channels☆11Apr 8, 2022Updated 4 years ago
- ☆131Jun 9, 2022Updated 3 years ago
- Memory-efficient transformer. Work in progress.☆19Sep 17, 2022Updated 3 years ago
- A GPT-J API to use with python3 to generate text, blogs, code, and more☆204Nov 12, 2022Updated 3 years ago
- Implementation of the GLOM model for text☆11Mar 4, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆34Aug 10, 2021Updated 4 years ago
- A GPT, made only of MLPs, in Jax☆59Jun 23, 2021Updated 4 years ago
- API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend☆335Oct 25, 2021Updated 4 years ago
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 3 years ago
- An OpenAI chatbot for Rocket.Chat written in Go☆14Aug 10, 2023Updated 2 years ago
- A CLIP conditioned Decision Transformer.☆22Jul 14, 2021Updated 4 years ago
- Repo for fine-tuning Casual LLMs☆465Mar 27, 2024Updated 2 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆21Jul 18, 2023Updated 2 years ago
- My solution to the SIIM-ACR Pneumothorax Segmentation Challenge on Kaggle, which got the 7th place.☆20Dec 8, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion☆12Nov 6, 2022Updated 3 years ago
- ☆110Aug 5, 2021Updated 4 years ago
- Lightweight GANを用いてラグナロクオンラインのキャラクター画像を生成するGAN☆12May 13, 2021Updated 5 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,437May 19, 2026Updated 2 weeks ago
- ☆78Dec 7, 2023Updated 2 years ago
- ☆31Mar 8, 2021Updated 5 years ago
- An easy way to start a python programming environment using GitHub Codespaces.☆15Sep 9, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆50Jan 4, 2023Updated 3 years ago
- ☆14Dec 28, 2021Updated 4 years ago
- This repository will be a summary and outlook on all our open, medical, AI advancements.☆30Feb 24, 2023Updated 3 years ago
- An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities☆178Jul 27, 2022Updated 3 years ago
- CPM的Transformer版☆17Nov 27, 2022Updated 3 years ago
- ☆21Mar 15, 2023Updated 3 years ago
- A white hat doppelganger application designed to look like Windows Event Viewer. The intended use of this is scam baiting.☆13May 27, 2021Updated 5 years ago
- Train vision models using JAX and 🤗 transformers☆102Dec 14, 2025Updated 5 months ago
- Content addressable graph where every node has at most a single link to another node☆20Nov 4, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Feb 18, 2023Updated 3 years ago
- Notebook for running GPT neo models based on GPT3☆61Aug 10, 2021Updated 4 years ago
- ☆216Oct 10, 2022Updated 3 years ago
- The Happy Faces Benchmark☆15Jul 20, 2023Updated 2 years ago
- Protect your API tagged routes with a simple set of known keys. Useful for locking down useage from API managers☆13Apr 3, 2018Updated 8 years ago
- Python implementation of the sparse clustering methods☆28Oct 20, 2022Updated 3 years ago
- GAN-based method to create counterfactual explanations for chest X-rays☆25Oct 27, 2025Updated 7 months ago