tunib-ai / large-scale-lm-tutorials
Large-scale language modeling tutorials with PyTorch
☆290Updated 3 years ago
Alternatives and similar repositories for large-scale-lm-tutorials:
Users that are interested in large-scale-lm-tutorials are comparing it to the libraries listed below
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆128Updated 2 years ago
- OSLO: Open Source framework for Large-scale model Optimization☆308Updated 2 years ago
- ☆186Updated 2 years ago
- KoCLIP: Korean port of OpenAI CLIP, in Flax☆150Updated last year
- 🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)☆202Updated last year
- "A survey of Transformer" paper study 👩🏻💻🧑🏻💻 KoreaUniv. DSBA Lab☆188Updated 3 years ago
- My collection of machine learning papers☆279Updated last year
- [Google Meet] MLLM Arxiv Casual Talk☆52Updated 2 years ago
- List of Korean pre-trained language models.☆188Updated last year
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆242Updated last year
- A performance library for machine learning applications.☆183Updated last year
- Jiphyeonjeon Season 2☆121Updated 2 years ago
- ☆83Updated 11 months ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆399Updated last month
- Data processing system for polyglot☆91Updated last year
- [2021 훈민정음 한국어 음성•자연어 인공지능 경진대회] 대화요약 부문 알라꿍달라꿍 팀의 대화요약 학습 및 추론 코드를 공유하기 위한 레포입니다.☆128Updated 2 years ago
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆76Updated last year
- Curation note of NLP datasets☆96Updated 2 years ago
- ☆196Updated last year
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…☆91Updated 5 months ago
- KLUE 데이터를 활용한 HuggingFace Transformers 튜토리얼☆129Updated 3 years ago
- Korean-English Bilingual Electra Models☆109Updated 3 years ago
- Baseline code for Korean open domain question answering(ODQA)☆77Updated last year
- IA3방식으로 KoAlpaca를 fine tuning한 한국어 LLM모델☆68Updated last year
- ChatGPT의 RLHF를 학습을 위한 3가지 step별 한국어 데이터셋☆31Updated last year
- Review papers of NLP, mainly LLM.☆28Updated 11 months ago
- A clean and structured implementation of Transformer with wandb and pytorch-lightning☆71Updated 2 years ago
- Benchmark in Korean Context☆129Updated last year
- Liner LLM Meetup archive☆71Updated 11 months ago
- Jiphyeonjeon Season 3☆39Updated 2 years ago