FerreroJeremy/Cross-Language-Dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FerreroJeremy/Cross-Language-Dataset)

FerreroJeremy / Cross-Language-Dataset

A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection

☆61

Alternatives and similar repositories for Cross-Language-Dataset

Users that are interested in Cross-Language-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alex-berard / multivec
View on GitHub
A Multilingual and Multilevel Representation Learning Toolkit for NLP
☆117Feb 14, 2018Updated 8 years ago
gouwsmeister / bilbowa
View on GitHub
Open-source implementation of the BilBOWA (Bilingual Bag-of-Words without Alignments) word embedding model.
☆69Jul 28, 2021Updated 4 years ago
nmrksic / eval-multilingual-simlex
View on GitHub
Tool for Evaluating Multilingual WS-353 and SimLex-999
☆10Dec 15, 2016Updated 9 years ago
lmthang / bivec
View on GitHub
Train bilingual embeddings as described in our NAACL 2015 workshop paper "Bilingual Word Representations with Monolingual Quality in Mind…
☆79Jun 15, 2019Updated 7 years ago
viking-sudo-rm / rusty-dawg
View on GitHub
Rust library for indexing and quickly searching large pretraining corpora
☆31Oct 30, 2025Updated 8 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
deontologician / atari_multitask
View on GitHub
Atari gauntlet for RL agents
☆29Mar 18, 2017Updated 9 years ago
artetxem / vecmap
View on GitHub
A framework to learn cross-lingual word embedding mappings
☆654Apr 22, 2023Updated 3 years ago
longdt219 / XlingualEmb
View on GitHub
Crosslingual word embeddings described in our EMNLP paper
☆16Sep 21, 2016Updated 9 years ago
thunlp / CLSP
View on GitHub
Code and data for EMNLP 2018 paper "Cross-lingual Lexical Sememe Prediction"
☆19Nov 9, 2018Updated 7 years ago
talolard / DenseContinuousSentances
View on GitHub
An aspiring attempt to generate a continuous space of sentences with DenseNet
☆26May 4, 2017Updated 9 years ago
alvations / stasis
View on GitHub
Semantic Textual Similarity in Python
☆80Jan 30, 2017Updated 9 years ago
oir / deep-recursive
View on GitHub
Implementation of a deep recursive net over binary parse trees (code for NIPS2014 paper)
☆28Feb 6, 2015Updated 11 years ago
brmson / dataset-factoid-curated
View on GitHub
A curated question answering research dataset of factoid questions
☆49Nov 9, 2019Updated 6 years ago
SushantKafle / speechtext-wimp-labeler
View on GitHub
This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…
☆11Mar 24, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
syhw / speech_embeddings
View on GitHub
Using embedding-based loss functions for phonetics/speech recognition.
☆17Nov 24, 2014Updated 11 years ago
Alibaba-NLP / MuVER
View on GitHub
[EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations
☆32May 23, 2022Updated 4 years ago
rkadlec / asreader
View on GitHub
This is an implementation of the Attention Sum Reader model as presented in "Text Comprehension with the Attention Sum Reader Network" av…
☆98Sep 9, 2016Updated 9 years ago
salokr / Email-Event-Extraction
View on GitHub
☆10Sep 6, 2024Updated last year
rkargon / Scene-Labeling
View on GitHub
Experiments with an rCNN for scene labeling.
☆15Mar 20, 2019Updated 7 years ago
neubig / yrsnlp-2016
View on GitHub
Structured Neural Networks for NLP: From Idea to Code
☆59Dec 13, 2016Updated 9 years ago
bmcfee / ml_scraps
View on GitHub
Scraps of random machine learning code
☆15Oct 19, 2016Updated 9 years ago
pdasigi / neural-semantic-encoders
View on GitHub
Reimplementation of Munkhdalai et al's Neural Semantic Encoders (https://arxiv.org/pdf/1607.04315v2.pdf)
☆59Oct 28, 2016Updated 9 years ago
davidmoeljadi / INDRA
View on GitHub
Indonesian Resource Grammar (INDRA) - an implemented HPSG grammar for Indonesian
☆15Mar 15, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
BardiaKh / RadPrompter
View on GitHub
☆12Nov 1, 2025Updated 8 months ago
lilt / tec
View on GitHub
Evaluation code and data for "Automatic Correction of Human Translations" [NAACL 2022].
☆19Dec 9, 2022Updated 3 years ago
agadetsky / pytorch-definitions
View on GitHub
[ACL 2018] Conditional Generators of Words Definitions
☆33Jul 18, 2018Updated 7 years ago
caesarnine / mimic-iii-language-model
View on GitHub
Attempts to create a state of the art language model on clinical and medical text data.
☆12Oct 9, 2018Updated 7 years ago
svishnu88 / pytorch
View on GitHub
All My Pytorch projects reside here
☆33Dec 10, 2017Updated 8 years ago
TheShadow29 / infnet-spen
View on GitHub
TensorFlow implementation [ICLR 18] "Learning Approximate Inference Networks for Structured Prediction"
☆30Jun 10, 2018Updated 8 years ago
manuyavuz / temporal-embeddings
View on GitHub
A curated list of resources related to temporal embeddings
☆15Dec 14, 2018Updated 7 years ago
ma-sultan / monolingual-word-aligner
View on GitHub
☆81Mar 8, 2014Updated 12 years ago
rajarshd / Gaussian_LDA
View on GitHub
☆143Dec 31, 2019Updated 6 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
BIDS-Xu-Lab / Biomedical-NLP-Benchmarks
View on GitHub
Benchmark Datasets for BioNLP Tasks
☆17May 7, 2025Updated last year
forest-snow / anchor-topic
View on GitHub
This package supports implementation of anchor-based topic modeling and variants of the anchoring algorithm in Python 3.
☆15Sep 17, 2018Updated 7 years ago
chenllliang / ParetoMNMT
View on GitHub
Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023
☆17Sep 27, 2023Updated 2 years ago
yanlinf / UXSenti
View on GitHub
Unsupervised Cross-lingual Sentiment Analysis (CoNLL 2019)
☆10Nov 4, 2019Updated 6 years ago
siddk / deep-nlp
View on GitHub
Tensorflow Tutorial files and Implementations of various Deep NLP and CV Models.
☆47Oct 3, 2016Updated 9 years ago
yuanxiaosc / Deep_dynamic_contextualized_word_representation
View on GitHub
TensorFlow code and pre-trained models for A Dynamic Word Representation Model Based on Deep Context. It combines the idea of BERT model…
☆15Dec 27, 2018Updated 7 years ago
studio-ousia / textent
View on GitHub
Representation Learning of Entities and Documents from Knowledge Base Descriptions
☆18Oct 6, 2018Updated 7 years ago