Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder LM (eg. Flan-T5).
☆168Jun 20, 2025Updated 11 months ago
Alternatives and similar repositories for lmppl
Users that are interested in lmppl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆29Mar 20, 2024Updated 2 years ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Nov 21, 2022Updated 3 years ago
- Lite Self-Training☆30Jul 25, 2023Updated 2 years ago
- Official code repository for Correct-N-Contrast☆22Jul 18, 2022Updated 3 years ago
- Word acquisition in neural language models (TACL 2022).☆21Jan 30, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆23Mar 11, 2026Updated 3 months ago
- R library for accessing data from everypolitician.org☆20Apr 24, 2018Updated 8 years ago
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…☆12May 15, 2024Updated 2 years ago
- Official implementation of BPA (CVPR 2022)☆13Jun 17, 2022Updated 4 years ago
- ☆14Feb 9, 2022Updated 4 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 3 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆94Jun 8, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A repository for experiments in quality-aware decoding☆18Jun 7, 2022Updated 4 years ago
- ☆24Nov 22, 2022Updated 3 years ago
- ☆13Apr 5, 2026Updated 2 months ago
- albumentations test☆11Jun 23, 2020Updated 5 years ago
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆81Feb 28, 2024Updated 2 years ago
- ☆21Mar 28, 2022Updated 4 years ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 5 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- Pytorch Tutorial for M1 students. This repository include Encoder Deocder model and Classification model building code.☆12Jun 1, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Arabic Word-Embedding (Word2vec) model training from Wikipedia articles☆11Dec 13, 2018Updated 7 years ago
- A library for evaluation of Grammatical Error Correction (GEC). Accepted to ACL'25 Demo: "gec-metrics: A Unified Library for Grammatical …☆14Jan 25, 2026Updated 4 months ago
- ☆43Oct 29, 2024Updated last year
- A accurate multilingual word aligner based on LaBSE☆24Oct 25, 2023Updated 2 years ago
- A powerful text cleaner for Japanese web texts☆12Jan 20, 2024Updated 2 years ago
- Aspect based sentiment analysis for Hindi☆11Aug 31, 2017Updated 8 years ago
- Convert English alphabet to Katakana☆15Feb 15, 2026Updated 4 months ago
- Entitypedia is an Extended Named Entity Dictionary from Wikipedia.☆13Dec 7, 2022Updated 3 years ago
- ☆20Apr 26, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 中文医学语料库☆15Jul 2, 2021Updated 4 years ago
- みんなが見たアニメ一覧をまとめて見れるやつ☆11Nov 19, 2025Updated 7 months ago
- CCL2022 领域问答库构建测评☆20Oct 31, 2022Updated 3 years ago
- ☆23Feb 26, 2024Updated 2 years ago
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 10 months ago
- Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings☆97Jun 12, 2023Updated 3 years ago
- ☆351Aug 8, 2021Updated 4 years ago