Simple Annotated implementation of GPT-NeoX in PyTorch
☆110Aug 11, 2022Updated 3 years ago
Alternatives and similar repositories for neox
Users that are interested in neox are comparing it to the libraries listed below
Sorting:
- Experiments dashboard for LabML☆17Dec 11, 2022Updated 3 years ago
- ☆34Aug 10, 2021Updated 4 years ago
- ☆131Jun 9, 2022Updated 3 years ago
- Notebooks and tests for 🤗 Diffusers library☆10Aug 6, 2023Updated 2 years ago
- A GPT-J API to use with python3 to generate text, blogs, code, and more☆203Nov 12, 2022Updated 3 years ago
- Differentiable FFT Conv Layer with Dense Color Channels☆11Apr 8, 2022Updated 3 years ago
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 2 years ago
- Implementation of the GLOM model for text☆11Mar 4, 2021Updated 5 years ago
- ☆50Jan 4, 2023Updated 3 years ago
- ☆13Dec 19, 2018Updated 7 years ago
- An easy way to start a python programming environment using GitHub Codespaces.☆15Sep 9, 2020Updated 5 years ago
- Code publication to the paper "Normalized Attention Without Probability Cage"☆17Nov 9, 2021Updated 4 years ago
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion☆13Nov 6, 2022Updated 3 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Feb 17, 2023Updated 3 years ago
- Making a bridge between NLP models and Brain data☆19Jun 3, 2020Updated 5 years ago
- Memory-efficient transformer. Work in progress.☆19Sep 17, 2022Updated 3 years ago
- CPM的Transformer版☆17Nov 27, 2022Updated 3 years ago
- (unofficial) - customized fork of DETR, optimized for intelligent obj detection on 'real world' custom datasets☆12Aug 22, 2020Updated 5 years ago
- ☆24Dec 11, 2024Updated last year
- Here is an implementation of DeepLabv3+ in PyTorch(1.7). It supports many backbones and datasets.☆17Nov 14, 2022Updated 3 years ago
- https://www.kaggle.com/c/rsna-intracranial-hemorrhage-detection/☆19Oct 20, 2019Updated 6 years ago
- Dolores is a Python library designed to improve the developer experience when working with pretrained language models. Dolores provides p…☆34Jul 30, 2020Updated 5 years ago
- A text generation Transformer model trained on Reddit posts.☆16Jan 5, 2023Updated 3 years ago
- API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend☆335Oct 25, 2021Updated 4 years ago
- An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities☆178Jul 27, 2022Updated 3 years ago
- This project is used to generate a blog post using Natural Language processing, Hugging Face Transformers and GPT-2 Model.☆17May 24, 2021Updated 4 years ago
- FastAI Model Interpretation with LIME☆22Feb 7, 2019Updated 7 years ago
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…☆434Jun 14, 2023Updated 2 years ago
- ☆21Mar 31, 2022Updated 3 years ago
- ☆21Mar 15, 2023Updated 2 years ago
- A port of muP to JAX/Haiku☆25Oct 23, 2022Updated 3 years ago
- Stable Diffusion Video to Video, Image to Image, Template Prompt Generation system and more, for use with any stable diffusion model☆23Sep 14, 2022Updated 3 years ago
- Autoregressive transformer in JAX from scratch☆23Jan 28, 2022Updated 4 years ago
- Hyperparameter: The High-Performance Configuration Library for AI Systems☆22Dec 14, 2025Updated 2 months ago
- FastAPI for Triton☆18Jun 11, 2022Updated 3 years ago
- Open-AI's DALL-E for large scale training in mesh-tensorflow.☆431Feb 12, 2022Updated 4 years ago
- Playing around with stable diffusion. Generated images are reproducible because I save the metadata and latent information. You can gener…☆205Sep 14, 2022Updated 3 years ago
- Starter code for the CellSignal NeurIPS 2019 competition.☆45Jun 6, 2025Updated 9 months ago