EleutherAI / polyglotLinks
Polyglot: Large Language Models of Well-balanced Competence in Multi-languages
☆485Updated 2 years ago
Alternatives and similar repositories for polyglot
Users that are interested in polyglot are comparing it to the libraries listed below
Sorting:
- Data processing system for polyglot☆92Updated 2 years ago
- Korean Multi-task Instruction Tuning☆158Updated last year
- ☆197Updated 2 years ago
- ☆107Updated 2 years ago
- Large-scale language modeling tutorials with PyTorch☆291Updated 3 years ago
- [KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model☆75Updated last week
- IA3방식으로 KoAlpaca를 fine tuning한 한국어 LLM모델☆69Updated 2 years ago
- Korean Sentence Embedding Repository☆210Updated 9 months ago
- KoCLIP: Korean port of OpenAI CLIP, in Flax☆155Updated 2 years ago
- ☆123Updated 2 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆130Updated 2 years ago
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆48Updated last year
- Benchmark in Korean Context☆136Updated last year
- List of Korean pre-trained language models.☆187Updated 2 years ago
- 🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드☆58Updated 2 years ago
- Curation note of NLP datasets☆97Updated 2 years ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆244Updated 2 years ago
- Open Korean NLP Dataset Curation for the Users All Around the Globe☆152Updated last year
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆80Updated last year
- Korean Math Word Problems☆59Updated 3 years ago
- [Google Meet] MLLM Arxiv Casual Talk☆52Updated 2 years ago
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding☆309Updated 2 years ago
- bpe based korean t5 model for text-to-text unified framework☆63Updated last year
- Pecab: Pure python Korean morpheme analyzer based on Mecab☆171Updated last year
- ☆31Updated last year
- 한국어 언어모델 오픈소스☆82Updated 2 years ago
- ☆147Updated 3 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆434Updated 4 months ago
- 🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)☆202Updated last year
- KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch☆100Updated 3 years ago