FerreroJeremy / Cross-Language-DatasetLinks
A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
☆60Updated 8 years ago
Alternatives and similar repositories for Cross-Language-Dataset
Users that are interested in Cross-Language-Dataset are comparing it to the libraries listed below
Sorting:
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 6 years ago
- Large scale sentential paraphrases collection and annotation☆46Updated 2 years ago
- data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015☆43Updated 4 years ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 6 years ago
- ROUGE summarization evaluation metric, enhanced with use of Word Embeddings☆23Updated 6 years ago
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 6 years ago
- ☆33Updated 3 years ago
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆60Updated 2 years ago
- A curated question answering research dataset of factoid questions☆49Updated 5 years ago
- COLING 2018 Tutorial on Multilingual FrameNet: Automatic semantic role labeling for FrameNet☆25Updated 6 years ago
- The WebSplit Benchmark introducing "Split and Rephrase" task☆63Updated 6 years ago
- Preprocessing scripts to read definitions and other information from dictionaries☆22Updated 7 years ago
- Keras implementation of ontology aware token embeddings☆48Updated 6 years ago
- ☆38Updated 8 years ago
- Text Simplification Model based on Encoder-Decoder (includes Transformer and Seq2Seq) model.☆68Updated 2 years ago
- Named Entity Disambiguation for Noisy Text☆66Updated 8 years ago
- Dict2vec is a framework to learn word embeddings using lexical dictionaries.☆114Updated 4 years ago
- Pre-training character n-gram embeddings☆22Updated last year
- Sume is an implementation of the concept-based ILP model for summarization.☆37Updated 6 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆70Updated 6 years ago
- This is the reference implementation of commonly used coreference metrics.☆74Updated 7 years ago
- ☆24Updated 8 years ago
- ☆44Updated 7 years ago
- Convolutional network for entity linking (Naacl 2016)☆57Updated 8 years ago
- Cross-Lingual Alignment of Contextual Word Embeddings☆99Updated 5 years ago
- Massively Multilingual Transfer for NER☆86Updated 3 years ago
- takahe is a multi-sentence compression module☆53Updated 4 years ago
- An updated version of the Parser-v1 repo, used for Stanford's submission in the CoNLL17 shared task.☆47Updated 6 years ago
- Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.☆50Updated 3 years ago
- Getting started with AllenNLP and PyTorch by training a tweet classifier☆66Updated 7 years ago