intfloat / uts
python package for unsupervised text segmentation.
☆14Updated 8 years ago
Alternatives and similar repositories for uts:
Users that are interested in uts are comparing it to the libraries listed below
- Pre-training character n-gram embeddings☆22Updated last year
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31Updated 4 years ago
- Modularizing Unsupervised Sense Embedding☆29Updated 7 years ago
- Converter from UD-trees to BART representation☆36Updated 11 months ago
- A collection of English tweets annotated in Universal Dependencies.☆39Updated 3 years ago
- Training scripts and recipes for Sockeye Neural Machine Translation toolkit☆37Updated 5 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Updated 6 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- ☆33Updated 3 years ago
- Preprocessing scripts to read definitions and other information from dictionaries☆22Updated 7 years ago
- ☆27Updated 8 years ago
- Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.☆50Updated 3 years ago
- Code for the paper "Extreme Adaptation for Personalized Neural Machine Translation"☆42Updated 3 years ago
- Assessing syntactic abilities of BERT☆39Updated 5 years ago
- A python module to process data for Frame Semantic Parsing☆23Updated 4 years ago
- Text generation with entities as context☆30Updated 6 years ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 5 years ago
- EMNLP DiscoEval paper☆42Updated 5 years ago
- A web interface to understand language-specific BERT-models☆17Updated 10 months ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- ☆29Updated last year
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- ☆24Updated 5 years ago
- Code for ICLR 2019 paper 'CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model'☆21Updated 5 years ago
- Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.☆42Updated 5 years ago
- Diverse Natural Language Inference Collection - NLI dataset that can used to evaluate how well models perform distinct types of reasoning…☆36Updated 4 years ago
- The dataset and statistical analysis code released with the submission of EMNLP 2017 paper "Why We Need New Evaluation Metrics for NLG"☆19Updated 3 years ago
- pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference☆62Updated 2 years ago
- Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings☆95Updated last year
- ☆44Updated 7 years ago