☆131Jun 9, 2022Updated 3 years ago
Alternatives and similar repositories for minimal-gpt-neox-20b
Users that are interested in minimal-gpt-neox-20b are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆67Aug 24, 2022Updated 3 years ago
- ☆78Dec 7, 2023Updated 2 years ago
- One stop shop for all things carp☆58Sep 9, 2022Updated 3 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,437May 19, 2026Updated 2 weeks ago
- #인권코퍼스☆31Oct 6, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Used for adaptive human in the loop evaluation of language and embedding models.☆306Mar 1, 2023Updated 3 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Aug 11, 2022Updated 3 years ago
- DEPRECATED--all functionality moved to nbdev☆15Aug 3, 2022Updated 3 years ago
- Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense L…☆22Nov 14, 2022Updated 3 years ago
- ☆15Jun 10, 2022Updated 3 years ago
- ☆110Aug 5, 2021Updated 4 years ago
- annotated-transformer-kr☆15May 16, 2019Updated 7 years ago
- Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리☆17Jan 3, 2024Updated 2 years ago
- ☆12Mar 16, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An easy way to start a python programming environment using GitHub Codespaces.☆15Sep 9, 2020Updated 5 years ago
- ☆15Oct 17, 2023Updated 2 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- KoGPT2 on Huggingface Transformers☆33May 4, 2021Updated 5 years ago
- A GPT-J API to use with python3 to generate text, blogs, code, and more☆204Nov 12, 2022Updated 3 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆21Nov 28, 2022Updated 3 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆139Aug 2, 2023Updated 2 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Easy installer of kocohub dataset☆24May 31, 2020Updated 6 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 7 months ago
- ☆11Oct 3, 2021Updated 4 years ago
- This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EM…☆16Dec 9, 2022Updated 3 years ago
- Long-context pretrained encoder-decoder models☆96Oct 28, 2022Updated 3 years ago
- Calculating Expected Time for training LLM.☆39Apr 17, 2023Updated 3 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆74May 16, 2022Updated 4 years ago
- ☆33Aug 30, 2023Updated 2 years ago
- 초성 해석기 based on ko-BART☆29Mar 31, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Redwood Research's transformer interpretability tools☆15Apr 15, 2022Updated 4 years ago
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes☆20May 30, 2023Updated 3 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆28Dec 9, 2022Updated 3 years ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆61Apr 9, 2024Updated 2 years ago
- ☆19Sep 20, 2022Updated 3 years ago
- ☆35Apr 8, 2023Updated 3 years ago