joshualoehr / ngram-language-modelLinks
Python implementation of an N-gram language model with Laplace smoothing and sentence generation.
☆88Updated 7 years ago
Alternatives and similar repositories for ngram-language-model
Users that are interested in ngram-language-model are comparing it to the libraries listed below
Sorting:
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆143Updated 3 years ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆47Updated 2 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆121Updated 2 years ago
- ☆61Updated 2 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆255Updated last year
- [EMNLP 2022] Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning☆136Updated 2 years ago
- Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …☆152Updated 2 years ago
- The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper☆56Updated last year
- Improved version of GECToR☆61Updated 2 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Updated 4 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)☆30Updated 3 years ago
- ☆14Updated 4 years ago
- Dialog State Tracking Challenge 2 & 3 Data☆86Updated 3 years ago
- Improving Unsupervised Dialogue Topic Segmentation with Utterance-Pair Coherence Scoring☆68Updated last year
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆48Updated 3 years ago
- cLang-8 is a dataset for grammatical error correction.☆110Updated 3 years ago
- mSimCSE: Multilingual SimCSE☆34Updated 3 years ago
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆58Updated last year
- DialogSum: A Real-life Scenario Dialogue Summarization Dataset - Findings of ACL 2021☆184Updated last year
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆296Updated 3 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆76Updated 3 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆96Updated 9 months ago
- Hierarchical Sketch Induction for Paraphrase Generation (Hosking et al., ACL 2022)☆51Updated 2 years ago
- ☆33Updated 2 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆119Updated 2 years ago
- An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.☆53Updated 4 years ago
- ☆43Updated 2 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆153Updated 5 years ago