โ184May 26, 2023Updated 2 years ago
Alternatives and similar repositories for longt5
Users that are interested in longt5 are comparing it to the libraries listed below
Sorting:
- Long-context pretrained encoder-decoder modelsโ96Oct 28, 2022Updated 3 years ago
- Tutorial to pretrain & fine-tune a ๐ค Flax T5 model on a TPUv3-8 with GCPโ58Jul 28, 2022Updated 3 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuningโ98Apr 26, 2023Updated 2 years ago
- A utility for storing and reading files for Korean LM training ๐พโ35Oct 15, 2025Updated 4 months ago
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)โ25Apr 11, 2022Updated 3 years ago
- ๐ฆ Pretrained BigBird Model for Korean (up to 4096 tokens)โ201Dec 28, 2023Updated 2 years ago
- Korean Named Entity Corpusโ25May 12, 2023Updated 2 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arxโฆโ137Aug 2, 2023Updated 2 years ago
- This repo is for Korean wiki table question answering datasets described in the paper of Korean-Specific Dataset for Table Question Answeโฆโ91Oct 22, 2024Updated last year
- Finetuning Pipelineโ89Feb 25, 2022Updated 4 years ago
- [2022.05.16 ~ 2022.06.10] ๐ค๏ธ๋ฏธ์ธ ๋จผ์ง ์๋ ๋ง์ ์ฌ์ง๐ท - ๋ถ์คํธ์บ ํ AI Tech 3๊ธฐ ์ต์ข ํ๋ก์ ํธโ14Jun 11, 2022Updated 3 years ago
- kogpt๋ฅผ oslo๋ก ํ์ธํ๋ํ๋ ์์ .โ23Aug 26, 2022Updated 3 years ago
- Convenient Text-to-Text Training for Transformersโ19Dec 10, 2021Updated 4 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasetsโ130Nov 12, 2022Updated 3 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.โ48Jun 7, 2022Updated 3 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA Systemโ445May 9, 2022Updated 3 years ago
- โ14Dec 9, 2021Updated 4 years ago
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirziโ273Apr 15, 2023Updated 2 years ago
- bpe based korean t5 model for text-to-text unified frameworkโ63Apr 17, 2024Updated last year
- Korean Math Word Problemsโ59Jan 14, 2022Updated 4 years ago
- โ32Oct 30, 2023Updated 2 years ago
- โ367Apr 12, 2024Updated last year
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeโ111Aug 31, 2022Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)โ48Aug 2, 2021Updated 4 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluationโ11May 27, 2022Updated 3 years ago
- โ14May 3, 2022Updated 3 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codesโ12Jun 24, 2021Updated 4 years ago
- OSLO: Open Source framework for Large-scale model Optimizationโ309Aug 25, 2022Updated 3 years ago
- KOLD: Korean Offensive Language Datasetโ81Nov 13, 2022Updated 3 years ago
- โ537Feb 13, 2024Updated 2 years ago
- The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".โ28Jun 19, 2021Updated 4 years ago
- ํนํ๋ถ์ผ ํนํ๋ ํ๊ตญ์ด AI์ธ์ด๋ชจ๋ธ KorPatBERTโ67Jan 31, 2024Updated 2 years ago
- Dataset of Korean Threatening Conversationsโ72Nov 1, 2022Updated 3 years ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"โ456Sep 6, 2023Updated 2 years ago
- Tasks for describing differences between text distributions.โ17Aug 9, 2024Updated last year
- Parallelformers: An Efficient Model Parallelization Toolkit for Deploymentโ791Apr 24, 2023Updated 2 years ago
- The Multitask Long Document Benchmarkโ42Nov 2, 2022Updated 3 years ago
- Meta Representation Transformation for Low-resource Cross-lingual Learningโ41May 5, 2021Updated 4 years ago
- โ92Sep 29, 2021Updated 4 years ago