EleutherAI / polyglotLinks
Polyglot: Large Language Models of Well-balanced Competence in Multi-languages
☆484Updated 2 years ago
Alternatives and similar repositories for polyglot
Users that are interested in polyglot are comparing it to the libraries listed below
Sorting:
- ☆197Updated 2 years ago
- Korean Multi-task Instruction Tuning☆156Updated 2 years ago
- ☆107Updated 2 years ago
- Data processing system for polyglot☆93Updated 2 years ago
- Large-scale language modeling tutorials with PyTorch☆291Updated 4 years ago
- Korean Sentence Embedding Repository☆211Updated last year
- [KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model☆73Updated 5 months ago
- KoCLIP: Korean port of OpenAI CLIP, in Flax☆154Updated last month
- IA3방식으로 KoAlpaca를 fine tuning한 한국어 LLM모델☆69Updated 2 years ago
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆48Updated last year
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆130Updated 3 years ago
- Benchmark in Korean Context☆136Updated 2 years ago
- List of Korean pre-trained language models.☆189Updated 2 years ago
- ChatGPT의 RLHF를 학습을 위한 3가지 step별 한국어 데이터셋☆40Updated 2 years ago
- 🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드☆59Updated 2 years ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆249Updated 2 years ago
- ☆123Updated 2 years ago
- Open Korean NLP Dataset Curation for the Users All Around the Globe☆152Updated 2 years ago
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆82Updated last year
- Curation note of NLP datasets☆98Updated 3 years ago
- 🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)☆200Updated 2 years ago
- Pretrained Language Models for Korean☆399Updated 3 years ago
- Pecab: Pure python Korean morpheme analyzer based on Mecab☆172Updated last year
- ☆147Updated 3 years ago
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding☆309Updated 2 years ago
- Korean Math Word Problems☆59Updated 4 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆450Updated 9 months ago
- Simple Chit-Chat based on KoGPT2☆182Updated 2 years ago
- KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch☆101Updated 3 years ago
- bpe based korean t5 model for text-to-text unified framework☆63Updated last year