A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
☆61May 29, 2017Updated 8 years ago
Alternatives and similar repositories for Cross-Language-Dataset
Users that are interested in Cross-Language-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Multilingual and Multilevel Representation Learning Toolkit for NLP☆117Feb 14, 2018Updated 8 years ago
- Open-source implementation of the BilBOWA (Bilingual Bag-of-Words without Alignments) word embedding model.☆69Jul 28, 2021Updated 4 years ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- Train bilingual embeddings as described in our NAACL 2015 workshop paper "Bilingual Word Representations with Monolingual Quality in Mind…☆79Jun 15, 2019Updated 6 years ago
- Rust library for indexing and quickly searching large pretraining corpora☆31Oct 30, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Atari gauntlet for RL agents☆29Mar 18, 2017Updated 9 years ago
- Crosslingual word embeddings described in our EMNLP paper☆16Sep 21, 2016Updated 9 years ago
- Code and data for EMNLP 2018 paper "Cross-lingual Lexical Sememe Prediction"☆19Nov 9, 2018Updated 7 years ago
- An aspiring attempt to generate a continuous space of sentences with DenseNet☆26May 4, 2017Updated 8 years ago
- Semantic Textual Similarity in Python☆80Jan 30, 2017Updated 9 years ago
- Implementation of a deep recursive net over binary parse trees (code for NIPS2014 paper)☆28Feb 6, 2015Updated 11 years ago
- A curated question answering research dataset of factoid questions☆49Nov 9, 2019Updated 6 years ago
- This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…☆10Mar 24, 2023Updated 3 years ago
- Score your NLP paper review☆24Jul 17, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Using embedding-based loss functions for phonetics/speech recognition.☆17Nov 24, 2014Updated 11 years ago
- ☆12Aug 9, 2016Updated 9 years ago
- A calculator with equations and variables☆13Mar 23, 2016Updated 10 years ago
- ☆15May 19, 2017Updated 8 years ago
- A Spark based semantic reasoning engine☆14Mar 28, 2017Updated 9 years ago
- Distillation of Ensemble Dependency Parsers into a Single Graph-Based Parser☆11Oct 14, 2016Updated 9 years ago
- The pytorch implementation of paper "DeepChannel: Salience Estimation by Contrastive Learning for Extractive Document Summarization"☆25Feb 14, 2019Updated 7 years ago
- [EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations☆32May 23, 2022Updated 3 years ago
- This is an implementation of the Attention Sum Reader model as presented in "Text Comprehension with the Attention Sum Reader Network" av…☆98Sep 9, 2016Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Sep 6, 2024Updated last year
- Structured Neural Networks for NLP: From Idea to Code☆59Dec 13, 2016Updated 9 years ago
- Scraps of random machine learning code☆15Oct 19, 2016Updated 9 years ago
- Reimplementation of Munkhdalai et al's Neural Semantic Encoders (https://arxiv.org/pdf/1607.04315v2.pdf)☆59Oct 28, 2016Updated 9 years ago
- Indonesian Resource Grammar (INDRA) - an implemented HPSG grammar for Indonesian☆15Mar 15, 2026Updated last month
- ☆12Nov 1, 2025Updated 6 months ago
- Just a data☆11Oct 20, 2025Updated 6 months ago
- Twitter sentiment analysis part 5: Tfidf vectorizer, model comparison, lexical approach☆12Feb 27, 2018Updated 8 years ago
- [ACL 2018] Conditional Generators of Words Definitions☆33Jul 18, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Attempts to create a state of the art language model on clinical and medical text data.☆12Oct 9, 2018Updated 7 years ago
- All My Pytorch projects reside here☆33Dec 10, 2017Updated 8 years ago
- ☆81Mar 8, 2014Updated 12 years ago
- A curated list of resources related to temporal embeddings☆14Dec 14, 2018Updated 7 years ago
- ☆144Dec 31, 2019Updated 6 years ago
- Benchmark Datasets for BioNLP Tasks☆17May 7, 2025Updated 11 months ago
- This package supports implementation of anchor-based topic modeling and variants of the anchoring algorithm in Python 3.☆15Sep 17, 2018Updated 7 years ago