joshualoehr / ngram-language-modelLinks
Python implementation of an N-gram language model with Laplace smoothing and sentence generation.
☆87Updated 7 years ago
Alternatives and similar repositories for ngram-language-model
Users that are interested in ngram-language-model are comparing it to the libraries listed below
Sorting:
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆143Updated 3 years ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆48Updated 3 years ago
- The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper☆57Updated last year
- Improved version of GECToR☆62Updated 2 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆123Updated 2 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆153Updated 5 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 3 years ago
- [EMNLP 2022] Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning☆136Updated 2 years ago
- Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)☆30Updated 3 years ago
- Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …☆153Updated 2 years ago
- ☆61Updated 2 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- cLang-8 is a dataset for grammatical error correction.☆112Updated 3 years ago
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆296Updated 3 years ago
- An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.☆53Updated 4 years ago
- mSimCSE: Multilingual SimCSE☆33Updated 3 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Updated 4 years ago
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆61Updated last year
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆253Updated 2 years ago
- Improving Unsupervised Dialogue Topic Segmentation with Utterance-Pair Coherence Scoring☆69Updated last year
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆142Updated 2 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆118Updated 2 years ago
- The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"☆20Updated 3 years ago
- ☆43Updated 2 years ago
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆61Updated 4 years ago
- ☆254Updated last year
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆80Updated 4 years ago
- Dialog State Tracking Challenge 2 & 3 Data☆87Updated 3 years ago
- Hierarchical Sketch Induction for Paraphrase Generation (Hosking et al., ACL 2022)☆51Updated 2 years ago
- Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System (ACL 2022)☆162Updated 2 years ago