asahi417 / lmppl
Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder LM (eg. Flan-T5).
☆123Updated last month
Related projects: ⓘ
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆157Updated 3 years ago
- ☆160Updated last year
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆187Updated 7 months ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆96Updated 5 months ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆174Updated last week
- ☆178Updated last year
- A Multilingual Replicable Instruction-Following Model☆91Updated last year
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆177Updated last year
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆219Updated last year
- contrastive decoding☆174Updated last year
- A multilingual version of MS MARCO passage ranking dataset☆142Updated 11 months ago
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆269Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆91Updated last month
- Scalable training for dense retrieval models.☆268Updated last year
- Codebase, data and models for the SummaC paper in TACL☆80Updated 9 months ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆63Updated 3 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆159Updated 11 months ago
- Finetune mistral-7b-instruct for sentence embeddings☆65Updated 4 months ago
- A framework for few-shot evaluation of autoregressive language models.☆98Updated last year
- Train Dense Passage Retriever (DPR) with a single GPU☆128Updated 3 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆91Updated last year
- ☆210Updated 3 months ago
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆249Updated last year
- Code for Editing Factual Knowledge in Language Models☆134Updated 2 years ago
- Zero-shot Document Ranking with Large Language Models.☆88Updated 2 months ago
- Token-level Reference-free Hallucination Detection☆92Updated last year
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆104Updated 6 months ago
- BARTScore: Evaluating Generated Text as Text Generation☆315Updated 2 years ago
- A Survey of Attributions for Large Language Models☆155Updated 3 weeks ago
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆137Updated last year