cimeister/typical-sampling

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cimeister/typical-sampling)

cimeister / typical-sampling

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

☆81

Alternatives and similar repositories for typical-sampling

Users that are interested in typical-sampling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hyunwoongko / megatron-11b
View on GitHub
Megatron LM 11B on Huggingface Transformers
☆28Jul 11, 2021Updated 5 years ago
songys / 2021Langcon
View on GitHub
☆11Oct 3, 2021Updated 4 years ago
tunib-ai / artwork_captions
View on GitHub
Machine Generated Captions for Best Artworks
☆22Sep 21, 2022Updated 3 years ago
ModuNLP / hacking_transformers
View on GitHub
☆11Aug 12, 2020Updated 5 years ago
JoungheeKim / kor-spacing
View on GitHub
This is project for korean auto spacing
☆12Aug 3, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lucidrains / n-grammer-pytorch
View on GitHub
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
☆81Dec 4, 2022Updated 3 years ago
facebookresearch / unlikelihood_training
View on GitHub
Neural Text Generation with Unlikelihood Training
☆311Aug 31, 2021Updated 4 years ago
lassl / lassl
View on GitHub
Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets
☆130Nov 12, 2022Updated 3 years ago
rycolab / uid-decoding
View on GitHub
☆42Mar 8, 2021Updated 5 years ago
jason9693 / FROZEN
View on GitHub
☆14May 3, 2022Updated 4 years ago
gmftbyGMFTBY / PONE
View on GitHub
☆13Sep 20, 2020Updated 5 years ago
bagustris / ssl-ser
View on GitHub
Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"
☆10Mar 15, 2023Updated 3 years ago
tunib-ai / transformers
View on GitHub
🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed
☆31Feb 5, 2022Updated 4 years ago
wuch15 / HiTransformer
View on GitHub
ACL 2021: HiTransformer
☆13May 29, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
martiansideofthemoon / rankgen
View on GitHub
Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…
☆140Aug 2, 2023Updated 2 years ago
krishnap25 / mauve
View on GitHub
Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.
☆315Jul 12, 2024Updated 2 years ago
hkjeon13 / noising-korean
View on GitHub
한국어 문서에 노이즈를 추가합니다.
☆27Nov 9, 2022Updated 3 years ago
nng555 / ssmba
View on GitHub
☆61Apr 19, 2022Updated 4 years ago
deepspeedai / deepspeed-gpt-neox
View on GitHub
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
☆21Nov 28, 2022Updated 3 years ago
MrBananaHuman / KoGPT2ForParaphrasing
View on GitHub
TEMP
☆34Apr 2, 2020Updated 6 years ago
nawnoes / pytorch-gpt-x
View on GitHub
An implementation of an autoregressive language model using an improved Transformer and DeepSpeed pipeline parallelism.
☆29Jan 12, 2026Updated 6 months ago
sooftware / luna-transformer
View on GitHub
A PyTorch Implementation of the Luna: Linear Unified Nested Attention
☆41Jul 29, 2021Updated 4 years ago
studio-ousia / bpr
View on GitHub
Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering
☆175Jun 6, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
phosseini / GisPy
View on GitHub
GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/
☆13Jul 1, 2024Updated 2 years ago
triplet02 / KoNPron
View on GitHub
Convert Numerical Representations to Korean Pronunciation
☆14Apr 20, 2020Updated 6 years ago
kakaobrain / autowu
View on GitHub
Official repository for Automated Learning Rate Scheduler for Large-Batch Training (8th ICML Workshop on AutoML)
☆39Dec 3, 2021Updated 4 years ago
jason9693 / ETA4LLMs
View on GitHub
Calculating Expected Time for training LLM.
☆39Apr 17, 2023Updated 3 years ago
lucidrains / local-attention-flax
View on GitHub
Local Attention - Flax module for Jax
☆22May 26, 2021Updated 5 years ago
yxuansu / SimCTG
View on GitHub
[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation
☆478Mar 7, 2024Updated 2 years ago
monologg / ko_lm_dataformat
View on GitHub
A utility for storing and reading files for Korean LM training 💾
☆35Jul 18, 2026Updated last week
KLUE-benchmark / KLUE-baseline
View on GitHub
Finetuning Pipeline
☆89Feb 25, 2022Updated 4 years ago
sooftware / speech-paper-review
View on GitHub
Review of papers I read
☆14Dec 11, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
noowad93 / chosung-translator
View on GitHub
초성 해석기 based on ko-BART
☆29Mar 31, 2021Updated 5 years ago
jungokasai / twist_decoding
View on GitHub
☆30May 20, 2022Updated 4 years ago
simonjisu / pytorch_tutorials
View on GitHub
some tutorials for blog: simonjisu.github.io
☆23Mar 25, 2021Updated 5 years ago
kakao / kanana-2
View on GitHub
☆23Jun 30, 2026Updated 3 weeks ago
PlusLabNLP / PredictiveEngagement
View on GitHub
Code for Predictive Engagement: An Efficient Metric for Automatic Evaluation of Open-Domain Dialogue Systems
☆16Jun 8, 2021Updated 5 years ago
mgalley / DSTC7-End-to-End-Conversation-Modeling
View on GitHub
Grounded conversational dataset for end-to-end conversational AI (official DSTC7 data)
☆175Aug 20, 2024Updated last year
tanyuqian / ctc-gen-eval
View on GitHub
EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation
☆97Mar 20, 2023Updated 3 years ago