ENOT-AutoDL / gpt-j-6B-tensorrt-int8Links

GPT-J 6B inference on TensorRT with INT-8 precision

☆11

Alternatives and similar repositories for gpt-j-6B-tensorrt-int8

Users that are interested in gpt-j-6B-tensorrt-int8 are comparing it to the libraries listed below

Sorting:

DOUDOU0314 / GPT-J-hf
GPT-jax based on the official huggingface library
☆13Updated 4 years ago
NathanGodey / headless-lm
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆27Updated last year
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated 2 years ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆50Updated 3 years ago
lucidrains / n-grammer-pytorch
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
☆76Updated 2 years ago
yandex-research / DeDLOC
Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)
☆117Updated 3 years ago
gsarti / t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
☆58Updated 3 years ago
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆116Updated 2 years ago
sanyalsunny111 / Early_Weight_Avg
[COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training
☆17Updated 11 months ago
eyalbd2 / RL-based-Language-Modeling
☆13Updated 6 years ago
SeanNaren / minGPT
A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!
☆113Updated 2 years ago
mingruimingrui / fast-mosestokenizer
c++ mosestokenizer
☆18Updated last year
fpaupier / gRPC-multiprocessing
A boilerplate to use multiprocessing for your gRPC server in your Python project
☆26Updated 4 years ago
htoyryla / DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
☆59Updated 4 years ago
RobertCsordas / moe_layer
sigma-MoE layer
☆20Updated last year
ClashLuke / PerfTorch
High performance pytorch modules
☆18Updated 2 years ago
orevaahia / magnet-tokenization
☆13Updated 9 months ago
Knowledgator / TurboT5
Truly flash T5 realization!
☆70Updated last year
lucidrains / memory-editable-transformer
My explorations into editing the knowledge and memories of an attention network
☆35Updated 2 years ago
Lightning-Universe / lightning-ColossalAI
Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI
☆56Updated 2 years ago
AeroScripts / HiddenEngrams
Hidden Engrams: Long Term Memory for Transformer Model Inference
☆35Updated 4 years ago
pytorch-tpu / examples
This repository contains example code to build models on TPUs
☆30Updated 2 years ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆95Updated 2 years ago
microsoft / INMT-lite
Interactive Neural Machine Translation-lite (INMT-lite) is a framework to train and develop lite versions (.tflite) of models for neural …
☆47Updated last year
stefan-it / xlm-v-experiments
Experiments for XLM-V Transformers Integeration
☆13Updated 2 years ago
farisalasmary / deepspeech2-online-decoder
Online (real-time) decoder to be used with DeepSpeech2 model
☆25Updated 5 years ago
Geotrend-research / smaller-transformers
Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.
☆105Updated 3 years ago
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago
yashbonde / RNN-sim
Running massive simulations using RNNs on CPUs for building bots and all kinds of things.
☆14Updated 4 years ago
stas00 / porting
Helper scripts and notes that were used while porting various nlp models
☆47Updated 3 years ago