☆131Jun 9, 2022Updated 4 years ago
Alternatives and similar repositories for minimal-gpt-neox-20b
Users that are interested in minimal-gpt-neox-20b are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆67Aug 24, 2022Updated 3 years ago
- ☆79Dec 7, 2023Updated 2 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,443Jun 11, 2026Updated 2 weeks ago
- #인권코퍼스☆31Oct 6, 2023Updated 2 years ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆306Mar 1, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Aug 11, 2022Updated 3 years ago
- DEPRECATED--all functionality moved to nbdev☆15Aug 3, 2022Updated 3 years ago
- Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense L…☆22Nov 14, 2022Updated 3 years ago
- ☆15Jun 10, 2022Updated 4 years ago
- ☆110Aug 5, 2021Updated 4 years ago
- annotated-transformer-kr☆15May 16, 2019Updated 7 years ago
- Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리☆17Jan 3, 2024Updated 2 years ago
- An easy way to start a python programming environment using GitHub Codespaces.☆15Sep 9, 2020Updated 5 years ago
- ☆15Oct 17, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆32Sep 27, 2021Updated 4 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- KoGPT2 on Huggingface Transformers☆33May 4, 2021Updated 5 years ago
- A GPT-J API to use with python3 to generate text, blogs, code, and more☆204Nov 12, 2022Updated 3 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆21Nov 28, 2022Updated 3 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆139Aug 2, 2023Updated 2 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆33Jul 20, 2022Updated 3 years ago
- Easy installer of kocohub dataset☆24May 31, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 8 months ago
- ☆11Oct 3, 2021Updated 4 years ago
- ☆127Mar 20, 2026Updated 3 months ago
- This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EM…☆16Dec 9, 2022Updated 3 years ago
- Long-context pretrained encoder-decoder models☆97Oct 28, 2022Updated 3 years ago
- Calculating Expected Time for training LLM.☆39Apr 17, 2023Updated 3 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆74May 16, 2022Updated 4 years ago
- ☆33Aug 30, 2023Updated 2 years ago
- 초성 해석기 based on ko-BART☆29Mar 31, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Redwood Research's transformer interpretability tools☆15Apr 15, 2022Updated 4 years ago
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes☆20May 30, 2023Updated 3 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆28Dec 9, 2022Updated 3 years ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆19Jan 31, 2025Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆61Apr 9, 2024Updated 2 years ago
- ☆19Sep 20, 2022Updated 3 years ago
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 6 years ago