Train your own GPT2!
☆14Apr 11, 2023Updated 2 years ago
Alternatives and similar repositories for KoGPT2-train
Users that are interested in KoGPT2-train are comparing it to the libraries listed below
Sorting:
- ☆10Aug 24, 2022Updated 3 years ago
- ☆15May 20, 2023Updated 2 years ago
- ☆23Oct 30, 2023Updated 2 years ago
- OpenOrca-KO dataset을 활용하여 llama2를 fine-tuning한 Korean-OpenOrca☆18Nov 1, 2023Updated 2 years ago
- KoBART chatbot☆45Jun 22, 2021Updated 4 years ago
- For the rlhf learning environment of Koreans☆25Sep 25, 2023Updated 2 years ago
- ☆32Nov 16, 2021Updated 4 years ago
- Automated Story-Telling using Event Representations (ASTER) from the AAAI 2018 paper "Event Representations for Automated Story Generatio…☆25Mar 3, 2022Updated 4 years ago
- 구글에서 발표한 Chain-of-Thought Reasoning without Prompting을 코드로 구현한 레포입니다.☆65Sep 28, 2024Updated last year
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆38Jan 17, 2024Updated 2 years ago
- ☆197May 22, 2023Updated 2 years ago
- Distilling Task-Specific Knowledge from Teacher Model into BiLSTM☆32Dec 14, 2024Updated last year
- T5-base model for Korean☆27May 20, 2021Updated 4 years ago
- ☆34Jul 25, 2024Updated last year
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- Adversarial Test Dataset for Korean Multi-turn Response Selection☆34Dec 16, 2021Updated 4 years ago
- python library☆12Nov 25, 2025Updated 3 months ago
- Explains the conclusions of a logic program.☆10May 25, 2023Updated 2 years ago
- Official Code for Towards Transparent and Explainable Attention Models paper (ACL 2020)☆35Jun 22, 2022Updated 3 years ago
- ☆12Nov 30, 2022Updated 3 years ago
- A hackable library for running and fine-tuning modern transformer models on commodity and alternative GPUs, powered by tinygrad.☆28Feb 10, 2026Updated 3 weeks ago
- ☆12Feb 9, 2022Updated 4 years ago
- Docker Bind 1.9 image with Webmin Interface☆11Oct 29, 2020Updated 5 years ago
- A synthetic training data generator for a text recognition CNN☆10Jul 8, 2019Updated 6 years ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated last month
- Demonstration of the NEON approach for explainable clustering.☆11Mar 17, 2022Updated 3 years ago
- A Pytorch-Lightning Implementation of Transformer Network☆11Oct 22, 2020Updated 5 years ago
- ↔️ Utilizing RBERT model structure for KLUE Relation Extraction task☆15Nov 15, 2022Updated 3 years ago
- ☆11Mar 10, 2023Updated 2 years ago
- implementation of Franck Farris recipes to produce wallpaper patterns☆11Mar 26, 2024Updated last year
- ☆11Nov 14, 2021Updated 4 years ago
- A high level pool for maintaining pools of *sql.DB databases (e.g: thousands of SQLite files)☆10Oct 29, 2016Updated 9 years ago
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- Comment toxicity classification using Karas/TensorFlow☆10May 25, 2018Updated 7 years ago
- Playground project acting as an example for a complex LangChain workflow☆11Jun 20, 2023Updated 2 years ago
- This is fork of code.google.com/p/snappy-go.☆11Mar 8, 2015Updated 10 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- Listen to a Redis PubSub chanhel and then rebroadcast over WebSockets.☆12Jun 23, 2016Updated 9 years ago
- Experiments with reasoning models, training techniques, papers☆25Updated this week