An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
☆17Jul 1, 2022Updated 3 years ago
Alternatives and similar repositories for gpt-neox
Users that are interested in gpt-neox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Aug 29, 2023Updated 2 years ago
- This is the code for the paper Statistical Recurrent Models on Manifold valued Data☆17May 16, 2021Updated 5 years ago
- ☆14Mar 9, 2023Updated 3 years ago
- ACO - A protocol for decentralized options☆21Jan 24, 2023Updated 3 years ago
- PyTorch implementation of Group Normalization https://arxiv.org/abs/1803.08494☆13Mar 23, 2018Updated 8 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- repository for R library "sbrlmod"☆26May 5, 2024Updated 2 years ago
- Hierarchical Encoder Decoder for Dialog Modelling☆16May 20, 2015Updated 11 years ago
- Training Neural Networks Without Gradients: A Scalable ADMM Approach python implement☆14Jun 20, 2017Updated 9 years ago
- shapefile to geojson and topojson☆20Apr 29, 2014Updated 12 years ago
- Testing KAN-based text generation GPT models☆19May 6, 2024Updated 2 years ago
- tf.keras implementation for OpenAI GPT 2☆15Jul 6, 2019Updated 6 years ago
- Spoken Language Understanding(SLU)/Slot Filling(语义槽填充) in PyTorch☆14May 22, 2018Updated 8 years ago
- A toy programming language, syntax based on C.☆11Mar 8, 2020Updated 6 years ago
- Rotate RoI Align and Rotate Position Sensitive RoI Align Operation in Caffe☆15Dec 5, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICML2025 Oral] LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently☆32Oct 22, 2025Updated 8 months ago
- This repository contains the dataset of our ISSTA 2018 paper: An Empirical Study on TensorFlow Program Bugs.☆29May 20, 2020Updated 6 years ago
- Attempt at reproducing a SGNN's projection layer, but with word n-grams instead of skip-grams. Paper and more: http://aclweb.org/antholog…☆22Nov 6, 2022Updated 3 years ago
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.☆12Oct 12, 2018Updated 7 years ago
- Intuitive graphical representation of source code☆14Mar 15, 2023Updated 3 years ago
- Local LLM Web search using qwen model and Ollama☆15Feb 9, 2024Updated 2 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆21Nov 28, 2022Updated 3 years ago
- A web application built using gradio for classification of flower images☆10Apr 21, 2021Updated 5 years ago
- Manufacturing specifications☆25Jun 6, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A simple recommendation engine (by way of convolutions and embeddings) written in TensorFlow☆20Jun 5, 2017Updated 9 years ago
- GPT Code Descriptor and Markdown Creator☆14Apr 18, 2021Updated 5 years ago
- Very simple language implemented using antlr for beginners☆17Jun 27, 2018Updated 8 years ago
- ☆10Apr 30, 2024Updated 2 years ago
- MagickCache is a secure, high-performance caching tool for images, videos, audio, and metadata. It uses memory mapping for fast access, s…☆19Updated this week
- caffe implement☆22Apr 9, 2021Updated 5 years ago
- fork of x ROS wrapper for collaborative decentralized visual-inertial odometry☆12Jun 30, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Nov 27, 2023Updated 2 years ago
- Learn CPython internals by customizing the interpreter.☆18Feb 8, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A quantum video game!☆34Jan 18, 2019Updated 7 years ago
- An experimental implementation of the retrieval-enhanced language model☆74Dec 29, 2022Updated 3 years ago
- Bazel plugin for the asdf version manager☆11Aug 10, 2023Updated 2 years ago
- 3rd party dependencies for DALI project☆11Jun 11, 2026Updated 2 weeks ago
- ☆11Nov 14, 2022Updated 3 years ago
- ☆10Nov 17, 2023Updated 2 years ago
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆14Sep 15, 2023Updated 2 years ago