☆131Jun 9, 2022Updated 3 years ago
Alternatives and similar repositories for minimal-gpt-neox-20b
Users that are interested in minimal-gpt-neox-20b are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆67Aug 24, 2022Updated 3 years ago
- ☆78Dec 7, 2023Updated 2 years ago
- One stop shop for all things carp☆59Sep 9, 2022Updated 3 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,404Feb 3, 2026Updated last month
- #인권코퍼스☆31Oct 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Used for adaptive human in the loop evaluation of language and embedding models.☆307Mar 1, 2023Updated 3 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Aug 11, 2022Updated 3 years ago
- DEPRECATED--all functionality moved to nbdev☆15Aug 3, 2022Updated 3 years ago
- Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense L…☆22Nov 14, 2022Updated 3 years ago
- ☆15Jun 10, 2022Updated 3 years ago
- ☆111Aug 5, 2021Updated 4 years ago
- annotated-transformer-kr☆15May 16, 2019Updated 6 years ago
- Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리☆18Jan 3, 2024Updated 2 years ago
- ☆12Mar 16, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An easy way to start a python programming environment using GitHub Codespaces.☆15Sep 9, 2020Updated 5 years ago
- ☆14Oct 17, 2023Updated 2 years ago
- ☆32Sep 27, 2021Updated 4 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- KoGPT2 on Huggingface Transformers☆33May 4, 2021Updated 4 years ago
- A GPT-J API to use with python3 to generate text, blogs, code, and more☆203Nov 12, 2022Updated 3 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆21Nov 28, 2022Updated 3 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆138Aug 2, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- Easy installer of kocohub dataset☆24May 31, 2020Updated 5 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 5 months ago
- ☆11Oct 3, 2021Updated 4 years ago
- This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EM…☆16Dec 9, 2022Updated 3 years ago
- Long-context pretrained encoder-decoder models☆96Oct 28, 2022Updated 3 years ago
- Calculating Expected Time for training LLM.☆38Apr 17, 2023Updated 2 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆74May 16, 2022Updated 3 years ago
- ☆33Aug 30, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 초성 해석기 based on ko-BART☆29Mar 31, 2021Updated 4 years ago
- Redwood Research's transformer interpretability tools☆15Apr 15, 2022Updated 3 years ago
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes☆20May 30, 2023Updated 2 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆28Dec 9, 2022Updated 3 years ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆60Apr 9, 2024Updated last year
- ☆19Sep 20, 2022Updated 3 years ago