Train bilingual embeddings as described in our NAACL 2015 workshop paper "Bilingual Word Representations with Monolingual Quality in Mind". Besides, it has all the functionalities of word2vec with added features and code clarity. See README for more info.
☆79Jun 15, 2019Updated 6 years ago
Alternatives and similar repositories for bivec
Users that are interested in bivec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-lingual Dependency Parsing Based on Distributed Representations☆20Mar 2, 2018Updated 8 years ago
- BiCVM Code☆45May 14, 2018Updated 7 years ago
- A Multilingual and Multilevel Representation Learning Toolkit for NLP☆117Feb 14, 2018Updated 8 years ago
- Open-source implementation of the BilBOWA (Bilingual Bag-of-Words without Alignments) word embedding model.☆69Jul 28, 2021Updated 4 years ago
- scripts and data for ACL 16 paper☆14Jul 5, 2016Updated 9 years ago
- Geometry-aware Multilingual Embeddings☆26Dec 8, 2022Updated 3 years ago
- ☆56Aug 21, 2018Updated 7 years ago
- A framework to learn cross-lingual word embedding mappings☆654Apr 22, 2023Updated 2 years ago
- Graph-based Dependency Parser☆46Jan 25, 2016Updated 10 years ago
- Crosslingual word embeddings described in our EMNLP paper☆16Sep 21, 2016Updated 9 years ago
- Code for "Unsupervised Cross-lingual Transfer of Word Embedding Spaces" in EMNLP 2018☆24Dec 29, 2018Updated 7 years ago
- Simple CORPORA list crawler☆10Dec 2, 2016Updated 9 years ago
- CS224S Course Project☆14Jun 9, 2014Updated 11 years ago
- Lab exercises for the DL4MT winter school at DCU☆15Oct 21, 2015Updated 10 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆74Feb 25, 2015Updated 11 years ago
- Expletives vomiting library...☆13Apr 17, 2017Updated 8 years ago
- ☆14May 14, 2019Updated 6 years ago
- Bilingual sentence aligner (Gale & Church, 1993)☆14Jan 8, 2026Updated 2 months ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Oct 1, 2015Updated 10 years ago
- [NAACL 2018] Robust Sequence Labeling with Adversarial Training☆10Sep 30, 2019Updated 6 years ago
- Unsupervised Neural Machine Translation☆475Jul 8, 2020Updated 5 years ago
- This is a sample code for AutoSimulTrans Workshop (https://autosimtrans.github.io)☆18Dec 25, 2020Updated 5 years ago
- Parsito: Fast non-projective transition-based dependency parser☆14Nov 24, 2025Updated 4 months ago
- ☆21Apr 4, 2015Updated 10 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Dec 8, 2022Updated 3 years ago
- An extension of word2vec to learn phrase embeddings☆76Oct 24, 2018Updated 7 years ago
- Sense Disambiguation of Connectives for PDTB-Style Discourse Parsing☆14Jan 13, 2017Updated 9 years ago
- Retrofitting Word Vectors to Semantic Lexicons☆375Apr 9, 2019Updated 6 years ago
- Python code for training Paragram word embeddings. These achieve human-level performance on some word similiarty tasks including SimLex-9…☆30Feb 4, 2016Updated 10 years ago
- Semanticizest: dump parser and client☆20May 11, 2016Updated 9 years ago
- Top-Down BTG-based Preordering☆16Jan 14, 2016Updated 10 years ago
- Resources for the OpenNMT hackathon☆51May 24, 2019Updated 6 years ago
- C++ implementation of the Hellinger PCA for computing word embeddings.☆32Nov 11, 2016Updated 9 years ago
- Cynical data selection☆20Jan 16, 2021Updated 5 years ago
- Implementation of a deep recursive net over binary parse trees (code for NIPS2014 paper)☆28Feb 6, 2015Updated 11 years ago
- A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection☆61May 29, 2017Updated 8 years ago
- Easy to use scripts for evaluating word vectors on a variety of tasks.☆119Mar 26, 2021Updated 4 years ago
- SOTA TAG Parser☆15Jan 19, 2019Updated 7 years ago
- A memory-based morphological parser for Python☆16Oct 12, 2012Updated 13 years ago