π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
β81Mar 17, 2022Updated 4 years ago
Alternatives and similar repositories for typical-sampling
Users that are interested in typical-sampling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Megatron LM 11B on Huggingface Transformersβ27Jul 11, 2021Updated 4 years ago
- β11Oct 3, 2021Updated 4 years ago
- β11Aug 12, 2020Updated 5 years ago
- Machine Generated Captions for Best Artworksβ22Sep 21, 2022Updated 3 years ago
- This is project for korean auto spacingβ12Aug 3, 2020Updated 5 years ago
- Neural Text Generation with Unlikelihood Trainingβ310Aug 31, 2021Updated 4 years ago
- Code for Findings of ACL 2021 paper "Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain β¦β19Dec 16, 2022Updated 3 years ago
- β14May 3, 2022Updated 3 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasetsβ130Nov 12, 2022Updated 3 years ago
- β41Mar 8, 2021Updated 5 years ago
- β13Sep 20, 2020Updated 5 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"β10Mar 15, 2023Updated 3 years ago
- π Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeedβ31Feb 5, 2022Updated 4 years ago
- ACL 2021: HiTransformerβ13May 29, 2021Updated 4 years ago
- νκ΅μ΄ λ¬Έμμ λ Έμ΄μ¦λ₯Ό μΆκ°ν©λλ€.β27Nov 9, 2022Updated 3 years ago
- Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.β309Jul 12, 2024Updated last year
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arxβ¦β137Aug 2, 2023Updated 2 years ago
- β62Apr 19, 2022Updated 3 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.β21Nov 28, 2022Updated 3 years ago
- TEMPβ34Apr 2, 2020Updated 5 years ago
- An implementation of an autoregressive language model using an improved Transformer and DeepSpeed pipeline parallelism.β30Jan 12, 2026Updated 2 months ago
- A PyTorch Implementation of the Luna: Linear Unified Nested Attentionβ41Jul 29, 2021Updated 4 years ago
- Calculating Expected Time for training LLM.β38Apr 17, 2023Updated 2 years ago
- Official repository for Automated Learning Rate Scheduler for Large-Batch Training (8th ICML Workshop on AutoML)β39Dec 3, 2021Updated 4 years ago
- Convert Numerical Representations to Korean Pronunciationβ14Apr 20, 2020Updated 5 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answeringβ175Jun 6, 2021Updated 4 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/β13Jul 1, 2024Updated last year
- A utility for storing and reading files for Korean LM training πΎβ35Oct 15, 2025Updated 5 months ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generationβ475Mar 7, 2024Updated 2 years ago
- Review of papers I readβ14Dec 11, 2020Updated 5 years ago
- μ΄μ± ν΄μκΈ° based on ko-BARTβ29Mar 31, 2021Updated 4 years ago
- β30May 20, 2022Updated 3 years ago
- Finetuning Pipelineβ89Feb 25, 2022Updated 4 years ago
- Grounded conversational dataset for end-to-end conversational AI (official DSTC7 data)β175Aug 20, 2024Updated last year
- MeCab model trained with OpenKorPos.β23Jun 19, 2022Updated 3 years ago
- Code for Predictive Engagement: An Efficient Metric for Automatic Evaluation of Open-Domain Dialogue Systemsβ16Jun 8, 2021Updated 4 years ago
- some tutorials for blog: simonjisu.github.ioβ23Mar 25, 2021Updated 4 years ago
- β13Jul 20, 2023Updated 2 years ago
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generationβ97Mar 20, 2023Updated 3 years ago