π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
β81Mar 17, 2022Updated 3 years ago
Alternatives and similar repositories for typical-sampling
Users that are interested in typical-sampling are comparing it to the libraries listed below
Sorting:
- Megatron LM 11B on Huggingface Transformersβ27Jul 11, 2021Updated 4 years ago
- β11Aug 12, 2020Updated 5 years ago
- This is project for korean auto spacingβ12Aug 3, 2020Updated 5 years ago
- β11Oct 3, 2021Updated 4 years ago
- Review of papers I readβ14Dec 11, 2020Updated 5 years ago
- Machine Generated Captions for Best Artworksβ22Sep 21, 2022Updated 3 years ago
- β14May 3, 2022Updated 3 years ago
- β41Mar 8, 2021Updated 4 years ago
- Neural Text Generation with Unlikelihood Trainingβ310Aug 31, 2021Updated 4 years ago
- Convert Numerical Representations to Korean Pronunciationβ14Apr 20, 2020Updated 5 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasetsβ130Nov 12, 2022Updated 3 years ago
- A PyTorch Implementation of the Luna: Linear Unified Nested Attentionβ41Jul 29, 2021Updated 4 years ago
- νκ΅μ΄ λ¬Έμμ λ Έμ΄μ¦λ₯Ό μΆκ°ν©λλ€.β27Nov 9, 2022Updated 3 years ago
- π Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeedβ31Feb 5, 2022Updated 4 years ago
- β62Apr 19, 2022Updated 3 years ago
- An implementation of an autoregressive language model using an improved Transformer and DeepSpeed pipeline parallelism.β30Jan 12, 2026Updated last month
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arxβ¦β137Aug 2, 2023Updated 2 years ago
- β17Dec 28, 2023Updated 2 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.β21Nov 28, 2022Updated 3 years ago
- μ΄μ± ν΄μκΈ° based on ko-BARTβ29Mar 31, 2021Updated 4 years ago
- Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.β308Jul 12, 2024Updated last year
- Calculating Expected Time for training LLM.β38Apr 17, 2023Updated 2 years ago
- Grounded conversational dataset for end-to-end conversational AI (official DSTC7 data)β175Aug 20, 2024Updated last year
- β54Jan 18, 2023Updated 3 years ago
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.β21Jan 10, 2022Updated 4 years ago
- TEMPβ34Apr 2, 2020Updated 5 years ago
- A utility for storing and reading files for Korean LM training πΎβ35Oct 15, 2025Updated 4 months ago
- Korean Visual Question Answeringβ59Feb 18, 2020Updated 6 years ago
- Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.β19Jan 21, 2021Updated 5 years ago
- MeCab model trained with OpenKorPos.β23Jun 19, 2022Updated 3 years ago
- β14Feb 2, 2025Updated last year
- β13Jul 20, 2023Updated 2 years ago
- Official repository for Automated Learning Rate Scheduler for Large-Batch Training (8th ICML Workshop on AutoML)β40Dec 3, 2021Updated 4 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)β119Oct 8, 2020Updated 5 years ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generationβ475Mar 7, 2024Updated last year
- β41Feb 12, 2019Updated 7 years ago
- Tiny configuration for Triton Inference Serverβ45Jan 10, 2025Updated last year
- Anh - LAION's multilingual assistant datasets and modelsβ27Apr 5, 2023Updated 2 years ago
- μΈμ΄λͺ¨λΈμ νμ΅νκΈ° μν κ³΅κ° νκ΅μ΄ instruction datasetλ€μ λͺ¨μλμμ΅λλ€.β19Jul 16, 2023Updated 2 years ago