Anh - LAION's multilingual assistant datasets and models
☆27Apr 5, 2023Updated 2 years ago
Alternatives and similar repositories for Anh
Users that are interested in Anh are comparing it to the libraries listed below
Sorting:
- Beyond LM: How can language model go forward in the future?☆15Apr 30, 2023Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆209Jan 13, 2024Updated 2 years ago
- ☆19Sep 20, 2022Updated 3 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 5 months ago
- Calculating Expected Time for training LLM.☆38Apr 17, 2023Updated 2 years ago
- Experiments with generating opensource language model assistants☆97May 14, 2023Updated 2 years ago
- 한국어 어휘 의미 분석 모델☆22Apr 4, 2022Updated 3 years ago
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 5 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Jul 16, 2023Updated 2 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆56Sep 1, 2023Updated 2 years ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- Machine Generated Captions for Best Artworks☆22Sep 21, 2022Updated 3 years ago
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).☆60May 3, 2022Updated 3 years ago
- Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3☆23May 20, 2021Updated 4 years ago
- kogpt를 oslo로 파인튜닝하는 예제.☆23Aug 26, 2022Updated 3 years ago
- ☆11Aug 12, 2020Updated 5 years ago
- Implementation of stop sequencer for Huggingface Transformers☆16Jun 6, 2023Updated 2 years ago
- ☆13Feb 26, 2023Updated 3 years ago
- 모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.☆11Mar 2, 2022Updated 4 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- Korean Moview Review Emotion (KMRE) Dataset☆21Sep 7, 2020Updated 5 years ago
- exBERT on Transformers🤗☆10Jun 14, 2021Updated 4 years ago
- MeCab model trained with OpenKorPos.☆23Jun 19, 2022Updated 3 years ago
- ☆17Updated this week
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆87Feb 19, 2023Updated 3 years ago
- Personal information identification standard☆21Jan 24, 2024Updated 2 years ago
- OSLO: Open Source for Large-scale Optimization☆175Sep 9, 2023Updated 2 years ago
- NSMC, KorSTS ... fine-tunings☆18Feb 23, 2022Updated 4 years ago
- Data processing system for polyglot☆93Sep 5, 2023Updated 2 years ago
- huggingface를 이용하여 downstream task 수행하기☆62Dec 28, 2021Updated 4 years ago
- [제 11회 투빅스 컨퍼런스] AM I OK ? - 전문의 답변 기반 심리진단 AI☆12Jan 19, 2021Updated 5 years ago
- ☆106May 8, 2023Updated 2 years ago
- Bias, Hate classification with KoELECTRA 👿☆27Jun 12, 2023Updated 2 years ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆13Nov 21, 2023Updated 2 years ago
- Parallel dataset of Korean Questions and Commands☆60Mar 24, 2023Updated 2 years ago
- #Paired Question☆24Jun 16, 2020Updated 5 years ago
- Python wrapper for KoalaNLP (Korean NLP with Java/Scala)☆31Jan 20, 2026Updated 2 months ago
- 초성 해석기 based on ko-BART☆29Mar 31, 2021Updated 4 years ago
- Korean large emotion labeled dataset (EmoNSMC)☆14Mar 5, 2020Updated 6 years ago