Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).
☆61Feb 7, 2022Updated 4 years ago
Alternatives and similar repositories for mirostat
Users that are interested in mirostat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.☆14Nov 17, 2023Updated 2 years ago
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆19Jun 27, 2023Updated 3 years ago
- code for training and using chess embeddings models☆14Jun 9, 2024Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Jun 7, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Packaged version of Masonry for MeteorJS☆10Jan 25, 2016Updated 10 years ago
- Code accompanying our papers on the "Generative Distributional Control" framework☆117Dec 7, 2022Updated 3 years ago
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆16Nov 1, 2021Updated 4 years ago
- Replication package for evaluation of code generation metrics☆17Nov 24, 2025Updated 7 months ago
- ☆14Apr 22, 2024Updated 2 years ago
- GeDi: Generative Discriminator Guided Sequence Generation☆209Jun 16, 2025Updated last year
- Code for our EMNLP 2019 paper titled "Sentence-Level Content Planning and Style Specification for Neural Text Generation"☆17May 4, 2020Updated 6 years ago
- PyTorch reimplementation of REALM and ORQA☆22Feb 3, 2022Updated 4 years ago
- Implementation of "Mutimodal Convolution Neural Networks for Matching Image and Sentence"☆12Oct 25, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Bert2Bert model which able to generate headlines!☆12Nov 16, 2020Updated 5 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Mar 17, 2020Updated 6 years ago
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 4 years ago
- ☆14Feb 24, 2021Updated 5 years ago
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆13Nov 14, 2022Updated 3 years ago
- Official code for SongEcho☆64Mar 3, 2026Updated 4 months ago
- Consistent dialogue generation☆16Oct 26, 2022Updated 3 years ago
- ☆15May 14, 2019Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- Posterior Control of Blackbox Generation☆23May 2, 2020Updated 6 years ago
- Scikit-learn quickstart tutorial for Webstep☆19May 4, 2017Updated 9 years ago
- Experiment Manager☆22Jan 15, 2020Updated 6 years ago
- f-PO: Generalizing Preference Optimization with f-divergence Minimization☆14Apr 2, 2025Updated last year
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆33Jul 20, 2022Updated 3 years ago
- Hawkes Point Processes in Python☆17Sep 7, 2013Updated 12 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Hosting telegram bot with Yandex.Cloud Functions☆14Jul 23, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A custom theme for crater.io☆16Dec 28, 2015Updated 10 years ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- ☆13Dec 12, 2025Updated 6 months ago
- A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch☆237Jun 12, 2023Updated 3 years ago
- [ICLR 2021] Group Equivariant Generative Adversarial Networks.☆14May 6, 2021Updated 5 years ago
- Pytorch implementation of The ICML 2020 paper "On Learning Sets of Symmetric Elements" by Haggai Maron, Or Litany, Gal Chechik, Ethan Fet…☆10Apr 22, 2021Updated 5 years ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Jun 13, 2024Updated 2 years ago