joshualoehr / ngram-language-modelLinks
Python implementation of an N-gram language model with Laplace smoothing and sentence generation.
☆87Updated 7 years ago
Alternatives and similar repositories for ngram-language-model
Users that are interested in ngram-language-model are comparing it to the libraries listed below
Sorting:
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆47Updated 2 years ago
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆141Updated 2 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆121Updated 2 years ago
- cLang-8 is a dataset for grammatical error correction.☆109Updated 3 years ago
- The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper☆56Updated last year
- ☆61Updated 2 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Updated 4 years ago
- Improved version of GECToR☆60Updated 2 years ago
- [EMNLP 2022] Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning☆136Updated last year
- An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.☆52Updated 4 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆117Updated 4 months ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆76Updated 3 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆56Updated last year
- 科大讯飞低资源多语种文本翻译挑战赛获奖方案☆29Updated 2 years ago
- ☆14Updated 3 years ago
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆296Updated 2 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆255Updated last year
- TEXTOIR: An Integrated and Visualized Platform for Text Open Intent Recognition (ACL 2021)☆53Updated 3 years ago
- Dialog State Tracking Challenge 2 & 3 Data☆87Updated 3 years ago
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆61Updated 4 years ago
- ☆187Updated last year
- Learning to Rewrite for Non-Autoregressive Neural Machine Translation☆21Updated 3 years ago
- The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"☆19Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)☆30Updated 2 years ago
- AMI and ICSI Corpora in JSON format.☆33Updated 2 years ago
- A lightweight implementation of Beam Search for sequence models in PyTorch.☆58Updated last year
- ☆99Updated 3 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆48Updated 3 years ago
- Improving Unsupervised Dialogue Topic Segmentation with Utterance-Pair Coherence Scoring☆65Updated last year