π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
β81Mar 17, 2022Updated 4 years ago
Alternatives and similar repositories for typical-sampling
Users that are interested in typical-sampling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Megatron LM 11B on Huggingface Transformersβ27Jul 11, 2021Updated 4 years ago
- β11Oct 3, 2021Updated 4 years ago
- β11Aug 12, 2020Updated 5 years ago
- Machine Generated Captions for Best Artworksβ22Sep 21, 2022Updated 3 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorchβ76Dec 4, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This is project for korean auto spacingβ12Aug 3, 2020Updated 5 years ago
- Code for Findings of ACL 2021 paper "Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain β¦β19Dec 16, 2022Updated 3 years ago
- Neural Text Generation with Unlikelihood Trainingβ311Aug 31, 2021Updated 4 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasetsβ130Nov 12, 2022Updated 3 years ago
- β41Mar 8, 2021Updated 5 years ago
- β14May 3, 2022Updated 4 years ago
- β13Sep 20, 2020Updated 5 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"β10Mar 15, 2023Updated 3 years ago
- π Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeedβ31Feb 5, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ACL 2021: HiTransformerβ13May 29, 2021Updated 4 years ago
- νκ΅μ΄ λ¬Έμμ λ Έμ΄μ¦λ₯Ό μΆκ°ν©λλ€.β27Nov 9, 2022Updated 3 years ago
- Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.β309Jul 12, 2024Updated last year
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arxβ¦β139Aug 2, 2023Updated 2 years ago
- β62Apr 19, 2022Updated 4 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.β21Nov 28, 2022Updated 3 years ago
- TEMPβ34Apr 2, 2020Updated 6 years ago
- An implementation of an autoregressive language model using an improved Transformer and DeepSpeed pipeline parallelism.β29Jan 12, 2026Updated 3 months ago
- A PyTorch Implementation of the Luna: Linear Unified Nested Attentionβ41Jul 29, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Calculating Expected Time for training LLM.β39Apr 17, 2023Updated 3 years ago
- Official repository for Automated Learning Rate Scheduler for Large-Batch Training (8th ICML Workshop on AutoML)β39Dec 3, 2021Updated 4 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answeringβ174Jun 6, 2021Updated 4 years ago
- Convert Numerical Representations to Korean Pronunciationβ14Apr 20, 2020Updated 6 years ago
- Local Attention - Flax module for Jaxβ22May 26, 2021Updated 4 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/β13Jul 1, 2024Updated last year
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generationβ476Mar 7, 2024Updated 2 years ago
- A utility for storing and reading files for Korean LM training πΎβ35Oct 15, 2025Updated 6 months ago
- Review of papers I readβ14Dec 11, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- μ΄μ± ν΄μκΈ° based on ko-BARTβ29Mar 31, 2021Updated 5 years ago
- β30May 20, 2022Updated 3 years ago
- Finetuning Pipelineβ89Feb 25, 2022Updated 4 years ago
- Grounded conversational dataset for end-to-end conversational AI (official DSTC7 data)β175Aug 20, 2024Updated last year
- Code for Predictive Engagement: An Efficient Metric for Automatic Evaluation of Open-Domain Dialogue Systemsβ16Jun 8, 2021Updated 4 years ago
- MeCab model trained with OpenKorPos.β23Jun 19, 2022Updated 3 years ago
- some tutorials for blog: simonjisu.github.ioβ23Mar 25, 2021Updated 5 years ago