tunib-ai / large-scale-lm-tutorials
Large-scale language modeling tutorials with PyTorch
☆290Updated 3 years ago
Alternatives and similar repositories for large-scale-lm-tutorials:
Users that are interested in large-scale-lm-tutorials are comparing it to the libraries listed below
- OSLO: Open Source framework for Large-scale model Optimization☆306Updated 2 years ago
- KoCLIP: Korean port of OpenAI CLIP, in Flax☆148Updated last year
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆127Updated 2 years ago
- A performance library for machine learning applications.☆182Updated last year
- 🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)☆202Updated last year
- ☆83Updated 9 months ago
- "A survey of Transformer" paper study 👩🏻💻🧑🏻💻 KoreaUniv. DSBA Lab☆186Updated 3 years ago
- [2021 훈민정음 한국어 음성•자연어 인공지능 경진대회] 대화요약 부문 알라꿍달라꿍 팀의 대화요약 학습 및 추론 코드를 공유하기 위한 레포입니다.☆128Updated 2 years ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆238Updated last year
- [Google Meet] MLLM Arxiv Casual Talk☆55Updated last year
- ☆195Updated last year
- List of Korean pre-trained language models.☆188Updated last year
- Polyglot: Large Language Models of Well-balanced Competence in Multi-languages☆479Updated last year
- Data processing system for polyglot☆92Updated last year
- Review papers of NLP, mainly LLM.☆29Updated 9 months ago
- ☆185Updated 2 years ago
- Curation note of NLP datasets☆95Updated 2 years ago
- ☆23Updated 4 months ago
- KLUE 데이터를 활용한 HuggingFace Transformers 튜토리얼☆129Updated 3 years ago
- Jiphyeonjeon Season 2☆121Updated 2 years ago
- My collection of machine learning papers☆275Updated last year
- OSLO: Open Source for Large-scale Optimization☆175Updated last year
- Korean Sentence Embedding Repository☆202Updated last month
- ☆56Updated 2 years ago
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding☆302Updated last year
- Liner LLM Meetup archive☆72Updated 9 months ago
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answe…☆91Updated 2 months ago
- Benchmark in Korean Context☆124Updated last year
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆373Updated 2 months ago
- Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)☆46Updated 10 months ago