☆131Jun 9, 2022Updated 3 years ago
Alternatives and similar repositories for minimal-gpt-neox-20b
Users that are interested in minimal-gpt-neox-20b are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆67Aug 24, 2022Updated 3 years ago
- ☆78Dec 7, 2023Updated 2 years ago
- One stop shop for all things carp☆59Sep 9, 2022Updated 3 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,416Updated this week
- #인권코퍼스☆31Oct 6, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Used for adaptive human in the loop evaluation of language and embedding models.☆307Mar 1, 2023Updated 3 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Aug 11, 2022Updated 3 years ago
- DEPRECATED--all functionality moved to nbdev☆15Aug 3, 2022Updated 3 years ago
- Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense L…☆22Nov 14, 2022Updated 3 years ago
- ☆15Jun 10, 2022Updated 3 years ago
- ☆110Aug 5, 2021Updated 4 years ago
- annotated-transformer-kr☆15May 16, 2019Updated 6 years ago
- Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리☆17Jan 3, 2024Updated 2 years ago
- ☆12Mar 16, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An easy way to start a python programming environment using GitHub Codespaces.☆15Sep 9, 2020Updated 5 years ago
- ☆14Oct 17, 2023Updated 2 years ago
- ☆32Sep 27, 2021Updated 4 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- KoGPT2 on Huggingface Transformers☆33May 4, 2021Updated 4 years ago
- A GPT-J API to use with python3 to generate text, blogs, code, and more☆203Nov 12, 2022Updated 3 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆21Nov 28, 2022Updated 3 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆138Aug 2, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- Easy installer of kocohub dataset☆24May 31, 2020Updated 5 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 6 months ago
- ☆11Oct 3, 2021Updated 4 years ago
- This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EM…☆16Dec 9, 2022Updated 3 years ago
- Long-context pretrained encoder-decoder models☆96Oct 28, 2022Updated 3 years ago
- Calculating Expected Time for training LLM.☆39Apr 17, 2023Updated 3 years ago
- ☆33Aug 30, 2023Updated 2 years ago
- 초성 해석기 based on ko-BART☆29Mar 31, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes☆20May 30, 2023Updated 2 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆28Dec 9, 2022Updated 3 years ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- ☆20Oct 22, 2021Updated 4 years ago
- Code for NeurIPS LLM Efficiency Challenge☆60Apr 9, 2024Updated 2 years ago
- ☆19Sep 20, 2022Updated 3 years ago
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 5 years ago