joshualoehr / ngram-language-modelLinks
Python implementation of an N-gram language model with Laplace smoothing and sentence generation.
☆88Updated 7 years ago
Alternatives and similar repositories for ngram-language-model
Users that are interested in ngram-language-model are comparing it to the libraries listed below
Sorting:
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆142Updated 3 years ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆47Updated 2 years ago
- Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …☆152Updated 2 years ago
- The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper☆56Updated last year
- ☆61Updated 2 years ago
- mSimCSE: Multilingual SimCSE☆34Updated 3 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆255Updated last year
- Improved version of GECToR☆60Updated 2 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆121Updated 2 years ago
- TEXTOIR: An Integrated and Visualized Platform for Text Open Intent Recognition (ACL 2021)☆53Updated 3 years ago
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆219Updated 4 years ago
- Hierarchical Sketch Induction for Paraphrase Generation (Hosking et al., ACL 2022)☆51Updated 2 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆153Updated 5 years ago
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆296Updated 3 years ago
- DialogSum: A Real-life Scenario Dialogue Summarization Dataset - Findings of ACL 2021☆183Updated 11 months ago
- The official repository for paper "It is AI’s Turn to Ask Humans a Question: Question-Answer Pair Generation for Children’s Story Books" …☆32Updated 2 years ago
- cLang-8 is a dataset for grammatical error correction.☆110Updated 3 years ago
- ☆250Updated last year
- [EMNLP 2022] Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning☆136Updated 2 years ago
- Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)☆30Updated 3 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 3 years ago
- The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"☆19Updated 2 years ago
- back translation for NLP☆26Updated 4 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Updated 4 years ago
- ☆33Updated 2 years ago
- Dialog State Tracking Challenge 2 & 3 Data☆86Updated 3 years ago
- Transitioning from Open-Domain Chit-Chat to Task-Oriented Dialogues☆43Updated 3 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆76Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆96Updated 8 months ago