EleutherAI/polyglot

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/EleutherAI/polyglot)

EleutherAI / polyglot

Polyglot: Large Language Models of Well-balanced Competence in Multi-languages

☆487

Alternatives and similar repositories for polyglot

Users that are interested in polyglot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Beomi / KoAlpaca
View on GitHub
KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model to understand Korean instructions)
☆1,574Oct 25, 2024Updated last year
sooftware / Korean-PLM
View on GitHub
List of Korean pre-trained language models.
☆189Aug 31, 2023Updated 2 years ago
jason9693 / oslo-kogpt-finetunig
View on GitHub
kogpt를 oslo로 파인튜닝하는 예제.
☆23Aug 26, 2022Updated 3 years ago
lassl / lassl
View on GitHub
Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets
☆130Nov 12, 2022Updated 3 years ago
tunib-ai / large-scale-lm-tutorials
View on GitHub
Large-scale language modeling tutorials with PyTorch
☆293Nov 2, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HeegyuKim / open-korean-instructions
View on GitHub
언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.
☆469Apr 13, 2025Updated last year
melodysdreamj / KoVicuna
View on GitHub
☆122Apr 21, 2023Updated 3 years ago
EleutherAI / polyglot-data
View on GitHub
data related codebase for polyglot project
☆19Mar 30, 2023Updated 3 years ago
LG-NLP / KorWikiTableQuestions
View on GitHub
This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…
☆91Oct 22, 2024Updated last year
nlpai-lab / KULLM
View on GitHub
☁️ 구름(KULLM): 고려대학교에서 개발한, 한국어에 특화된 LLM
☆588May 1, 2024Updated 2 years ago
EleutherAI / oslo
View on GitHub
OSLO: Open Source for Large-scale Optimization
☆175Sep 9, 2023Updated 2 years ago
kakaobrain / kogpt
View on GitHub
KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)
☆1,011Jan 30, 2024Updated 2 years ago
jason9693 / polyglot-finetuning-oslo
View on GitHub
☆19Sep 20, 2022Updated 3 years ago
lbox-kr / lbox-open
View on GitHub
☆108Apr 11, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Beomi / easy-lm-trainer
View on GitHub
🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드
☆59May 23, 2023Updated 3 years ago
naver-ai / korean-safety-benchmarks
View on GitHub
Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)
☆252Jun 29, 2023Updated 3 years ago
kakaobrain / kortok
View on GitHub
The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)
☆119Oct 8, 2020Updated 5 years ago
AIRC-KETI / ke-t5
View on GitHub
☆198May 22, 2023Updated 3 years ago
jason9693 / APEACH
View on GitHub
APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets
☆78Feb 5, 2023Updated 3 years ago
airobotlab / KoChatGPT
View on GitHub
ChatGPT의 RLHF를 학습을 위한 3가지 step별 한국어 데이터셋
☆42Nov 21, 2023Updated 2 years ago
monologg / KoELECTRA
View on GitHub
Pretrained ELECTRA Model for Korean
☆637Feb 19, 2024Updated 2 years ago
krafton-ai / KORani
View on GitHub
☆108May 8, 2023Updated 3 years ago
tunib-ai / DKTC
View on GitHub
Dataset of Korean Threatening Conversations
☆74Nov 1, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tunib-ai / tunib-electra
View on GitHub
Korean-English Bilingual Electra Models
☆110Nov 22, 2021Updated 4 years ago
KLUE-benchmark / KLUE
View on GitHub
📖 Korean NLU Benchmark
☆601Jun 30, 2026Updated 2 weeks ago
hyunwoongko / python-mecab-kor
View on GitHub
Yet another python binding for mecab-ko
☆88May 16, 2023Updated 3 years ago
EleutherAI / dps
View on GitHub
Data processing system for polyglot
☆93Jul 6, 2026Updated 2 weeks ago
tunib-ai / oslo
View on GitHub
OSLO: Open Source framework for Large-scale model Optimization
☆309Aug 25, 2022Updated 3 years ago
monologg / KoBigBird
View on GitHub
🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)
☆202Dec 28, 2023Updated 2 years ago
hyunwoongko / kss
View on GitHub
KSS: Korean String processing Suite
☆471Nov 13, 2025Updated 8 months ago
ko-nlp / Open-korean-corpora
View on GitHub
Open Korean NLP Dataset Curation for the Users All Around the Globe
☆158Jun 17, 2026Updated last month
hyunwoongko / pecab
View on GitHub
Pecab: Pure python Korean morpheme analyzer based on Mecab
☆172Apr 27, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
songys / AwesomeKorean_Data
View on GitHub
한국어 데이터 세트 링크
☆920Jun 17, 2026Updated last month
smilegate-ai / HuLiC
View on GitHub
☆93Mar 3, 2022Updated 4 years ago
MrBananaHuman / PangyoCorpora
View on GitHub
☆36Oct 4, 2023Updated 2 years ago
lovit / KoBERTScore
View on GitHub
BERTScore for Korean
☆81Feb 22, 2024Updated 2 years ago
wisdomify / wisdomify
View on GitHub
A BERT-based reverse dictionary of Korean proverbs
☆97Feb 28, 2023Updated 3 years ago
kakaobrain / kor-nlu-datasets
View on GitHub
KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding
☆314Jul 9, 2023Updated 3 years ago
BM-K / Sentence-Embedding-Is-All-You-Need
View on GitHub
Korean Sentence Embedding Repository
☆214Dec 1, 2024Updated last year