EleutherAI / polyglotLinks
Polyglot: Large Language Models of Well-balanced Competence in Multi-languages
☆483Updated 2 years ago
Alternatives and similar repositories for polyglot
Users that are interested in polyglot are comparing it to the libraries listed below
Sorting:
- Data processing system for polyglot☆92Updated 2 years ago
- Large-scale language modeling tutorials with PyTorch☆290Updated 4 years ago
- Korean Multi-task Instruction Tuning☆158Updated last year
- ☆107Updated 2 years ago
- ☆197Updated 2 years ago
- Korean Sentence Embedding Repository☆210Updated 11 months ago
- [KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model☆75Updated 2 months ago
- Benchmark in Korean Context☆137Updated 2 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆131Updated 2 years ago
- IA3방식으로 KoAlpaca를 fine tuning한 한국어 LLM모델☆69Updated 2 years ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆247Updated 2 years ago
- KoCLIP: Korean port of OpenAI CLIP, in Flax☆154Updated 2 years ago
- List of Korean pre-trained language models.☆188Updated 2 years ago
- Open Korean NLP Dataset Curation for the Users All Around the Globe☆152Updated last year
- ☆123Updated 2 years ago
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆80Updated last year
- ChatGPT의 RLHF를 학습을 위한 3가지 step별 한국어 데이터셋☆39Updated last year
- Curation note of NLP datasets☆99Updated 2 years ago
- KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch☆101Updated 3 years ago
- 🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)☆202Updated last year
- [Google Meet] MLLM Arxiv Casual Talk☆52Updated 2 years ago
- 🤗 최소한의 세 팅으로 LM을 학습하기 위한 샘플코드☆58Updated 2 years ago
- ☆68Updated last year
- Korean Math Word Problems☆59Updated 3 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆438Updated 6 months ago
- Pecab: Pure python Korean morpheme analyzer based on Mecab☆171Updated last year
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆48Updated last year
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding☆309Updated 2 years ago
- Korean LegalQA using SentenceKoBART☆96Updated 2 years ago
- ☆147Updated 3 years ago