As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)
โ48Aug 2, 2021Updated 4 years ago
Alternatives and similar repositories for gpt2-recycle
Users that are interested in gpt2-recycle are comparing it to the libraries listed below
Sorting:
- exBERT on Transformers๐คโ10Jun 14, 2021Updated 4 years ago
- ์ด๋ํธ, ์ด์ ํ, ๊น์ ๋ฆฌ, ๊นํ์ค, ๋ฐ์น๋ฉด, ์์ ์ค, ์ ์ ๋น (Dong Ho Lee, Jung Hoon Lee, Yu Ri Kim, Hyung Jun Kim, Seung Myun Park, Yu Jun Yang, Woong Bi Shin)โ15Apr 16, 2020Updated 5 years ago
- Korean Nested Named Entity Corpusโ20May 13, 2023Updated 2 years ago
- Korean Abstract Meaning Representation (AMR) Corpusโ10Feb 27, 2022Updated 4 years ago
- ํ๊ตญ์ด ์ดํ ์๋ฏธ ๋ถ์ ๋ชจ๋ธโ22Apr 4, 2022Updated 3 years ago
- ๋ฌธ์ฅ๋จ์๋ก ๋ถ์ ๋ ๋๋ฌด์ํค ๋ฐ์ดํฐ์ . Releases์์ ๋ค์ด๋ก๋ ๋ฐ๊ฑฐ๋, tfds-korean์ ํตํด ๋ค์ด๋ก๋ ๋ฐ์ผ์ธ์.โ19Jun 16, 2021Updated 4 years ago
- Korean large emotion labeled dataset (EmoNSMC)โ14Mar 5, 2020Updated 5 years ago
- ๋ค์ด๋ฒ ์ํ ๋ฆฌ๋ทฐ๋ฐ์ดํฐ๋ฅผ ํ์ฉํ ํ๊ธ ํ ์คํธ ๊ฐ์ ๋ถ์โ12Aug 22, 2018Updated 7 years ago
- #Paired Questionโ24Jun 16, 2020Updated 5 years ago
- Bias, Hate classification with KoELECTRA ๐ฟโ27Jun 12, 2023Updated 2 years ago
- Convenient Text-to-Text Training for Transformersโ19Dec 10, 2021Updated 4 years ago
- reference pytorch code for named entity taggingโ87Oct 18, 2024Updated last year
- 2019 ๊ตญ์ด๊ฒฝ์ง๋ํ ํ๊ตญ์ด ์์กด๊ตฌ๋ฌธ ๋ถ์ ๋์(๋ฌธ์ฒด๋ถ ์ฅ๊ด์)โ15Oct 26, 2022Updated 3 years ago
- Generate BERT vocabularies and pretraining examples from Wikipediasโ17May 11, 2020Updated 5 years ago
- Language Style๊ณผ ๊ฐ์ ์ ๋ฐ๋ฅธ ์ฑ๋ด ๋ต๋ณ ๋ณํ ๋ชจ๋ธโ33Aug 17, 2021Updated 4 years ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"โ18Oct 25, 2022Updated 3 years ago
- ํ๊ตญ ํ๋๋ฌธํ ๋ฐ์ฌํ์ ๋ ผ๋ฌธ ์์ง ๋ฐ์ดํฐ ๋ถ์โ23Aug 17, 2024Updated last year
- ๐ธ KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddingsโ58Jan 18, 2023Updated 3 years ago
- โ75Jul 2, 2021Updated 4 years ago
- ํ๊ตญ์ด ๋ฌธ์ฅ ๋์ด์ฐ๊ธฐ(์ญ์ /์ถ๊ฐ) ๋ชจ๋ธ์ ๋๋ค. ๋ฐ์ดํฐ ์ค๋น ํ ์ง์ ํ์ต์ด ๊ฐ๋ฅํ๋๋ก ์์ฑํ์์ต๋๋ค.โ57Jul 11, 2022Updated 3 years ago
- โ19Apr 1, 2022Updated 3 years ago
- Korean LegalQA using SentenceKoBARTโ97Mar 25, 2023Updated 2 years ago
- โ11Jul 12, 2021Updated 4 years ago
- โ11Aug 12, 2020Updated 5 years ago
- โ10Dec 17, 2020Updated 5 years ago
- ํ๊ตญ์ด ์ํธ์ฐธ์กฐํด๊ฒฐ (๊ฐ์ฒด ํ๋ณด ๋์)โ10Aug 12, 2020Updated 5 years ago
- Code for 'Contrastive Multi-Document Question Generation'โ11Oct 16, 2022Updated 3 years ago
- Combining encoder-based language modelsโ11Nov 11, 2021Updated 4 years ago
- [์ 11ํ ํฌ๋น ์ค ์ปจํผ๋ฐ์ค] AM I OK ? - ์ ๋ฌธ์ ๋ต๋ณ ๊ธฐ๋ฐ ์ฌ๋ฆฌ์ง๋จ AIโ12Jan 19, 2021Updated 5 years ago
- Implementation of the ACL Findings paper "OutFlip: Generating Examples for Unknown Intent Detection with Natural Language Attack"โ10May 24, 2021Updated 4 years ago
- Applied Data Science training course (for updates and resources, read the ReadMe file below)โ15Sep 9, 2023Updated 2 years ago
- reference pytorch code for intent classificationโ44Oct 18, 2024Updated last year
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"โ40Jan 2, 2019Updated 7 years ago
- Korean Speech to English Translation Corpusโ45Sep 3, 2021Updated 4 years ago
- โ184May 26, 2023Updated 2 years ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.โ27Apr 21, 2023Updated 2 years ago
- โ62Apr 19, 2022Updated 3 years ago
- Tokenizer ๋น๊ต ์คํโ11Jan 3, 2022Updated 4 years ago
- โ14Dec 9, 2021Updated 4 years ago