FerreroJeremy / Cross-Language-Dataset
A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
☆60Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for Cross-Language-Dataset
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 6 years ago
- Large scale sentential paraphrases collection and annotation☆47Updated last year
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 6 years ago
- Sume is an implementation of the concept-based ILP model for summarization.☆38Updated 6 years ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 5 years ago
- ☆56Updated 6 years ago
- ☆34Updated 3 years ago
- Multilingual hierarchical attention networks toolkit☆78Updated 4 years ago
- Dataset for the Emerging & Novel Entity NER task (WNUT '17)☆111Updated 2 years ago
- data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015☆44Updated 4 years ago
- Implement Overcoming the Lack of Parallel Data in Sentence Compression Katja Filippova and Yasemin Altun Google☆14Updated 8 years ago
- Text Simplification System and Dataset☆123Updated last year
- Abstractive Summarization IJCAI paper code☆29Updated 6 years ago
- The WebSplit Benchmark introducing "Split and Rephrase" task☆64Updated 6 years ago
- takahe is a multi-sentence compression module☆54Updated 3 years ago
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆60Updated last year
- Named Entity Disambiguation for Noisy Text☆67Updated 7 years ago
- ☆44Updated 6 years ago
- ☆38Updated 7 years ago
- ☆24Updated 8 years ago
- Bidirectional Long-Short Term Memory tagger (bi-LSTM) (in DyNet) -- hierarchical (with word and character embeddings)☆123Updated last year
- COLING 2018 Tutorial on Multilingual FrameNet: Automatic semantic role labeling for FrameNet☆25Updated 6 years ago
- Implementation of "Controlling Output Length in Neural Encoder-Decoders"☆42Updated 6 years ago
- An updated version of the Parser-v1 repo, used for Stanford's submission in the CoNLL17 shared task.☆47Updated 6 years ago
- Datasets for Question Answering by Search and Reading☆70Updated 6 years ago
- SideNet: Neural Extractive Summarization with Side Information☆57Updated 4 years ago
- Keras implementation of CoVe☆51Updated 6 years ago
- NLP research experiments, built on PyTorch within the AllenNLP framework.☆91Updated 8 months ago
- A transition-based parser for Universal Dependencies with BiLSTM word and character representations.☆80Updated 2 years ago