Train bilingual embeddings as described in our NAACL 2015 workshop paper "Bilingual Word Representations with Monolingual Quality in Mind". Besides, it has all the functionalities of word2vec with added features and code clarity. See README for more info.
☆79Jun 15, 2019Updated 6 years ago
Alternatives and similar repositories for bivec
Users that are interested in bivec are comparing it to the libraries listed below
Sorting:
- Cross-lingual Dependency Parsing Based on Distributed Representations☆20Mar 2, 2018Updated 8 years ago
- BiCVM Code☆45May 14, 2018Updated 7 years ago
- A Multilingual and Multilevel Representation Learning Toolkit for NLP☆117Feb 14, 2018Updated 8 years ago
- Open-source implementation of the BilBOWA (Bilingual Bag-of-Words without Alignments) word embedding model.☆69Jul 28, 2021Updated 4 years ago
- scripts and data for ACL 16 paper☆14Jul 5, 2016Updated 9 years ago
- CS224S Course Project☆14Jun 9, 2014Updated 11 years ago
- ☆56Aug 21, 2018Updated 7 years ago
- Simple CORPORA list crawler☆10Dec 2, 2016Updated 9 years ago
- Graph-based Dependency Parser☆46Jan 25, 2016Updated 10 years ago
- Lab exercises for the DL4MT winter school at DCU☆15Oct 21, 2015Updated 10 years ago
- A framework to learn cross-lingual word embedding mappings☆654Apr 22, 2023Updated 2 years ago
- Expletives vomiting library...☆13Apr 17, 2017Updated 8 years ago
- Python code for training Paragram word embeddings. These achieve human-level performance on some word similiarty tasks including SimLex-9…☆30Feb 4, 2016Updated 10 years ago
- C++ implementation of the Hellinger PCA for computing word embeddings.☆32Nov 11, 2016Updated 9 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Oct 1, 2015Updated 10 years ago
- McKernel: A Library for Approximate Kernel Expansions in Log-linear Time.☆13Sep 3, 2022Updated 3 years ago
- Geometry-aware Multilingual Embeddings☆26Dec 8, 2022Updated 3 years ago
- a latex cheat sheet with ipython commands and shortcuts☆10Mar 10, 2014Updated 11 years ago
- Bilingual sentence aligner (Gale & Church, 1993)☆14Jan 8, 2026Updated last month
- Code for "Unsupervised Cross-lingual Transfer of Word Embedding Spaces" in EMNLP 2018☆24Dec 29, 2018Updated 7 years ago
- ☆25Feb 21, 2019Updated 7 years ago
- Sense Disambiguation of Connectives for PDTB-Style Discourse Parsing☆14Jan 13, 2017Updated 9 years ago
- Retrofitting Word Vectors to Semantic Lexicons☆375Apr 9, 2019Updated 6 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆74Feb 25, 2015Updated 11 years ago
- A toolkit for social media information extraction using multi-task learning and active learning☆19Dec 27, 2022Updated 3 years ago
- Cynical data selection☆20Jan 16, 2021Updated 5 years ago
- Parsito: Fast non-projective transition-based dependency parser☆14Nov 24, 2025Updated 3 months ago
- Unsupervised Neural Machine Translation☆475Jul 8, 2020Updated 5 years ago
- Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings☆53Dec 6, 2016Updated 9 years ago
- Semanticizest: dump parser and client☆20May 11, 2016Updated 9 years ago
- maximum entropy based part-of-speech tagger for NLTK☆45Dec 8, 2016Updated 9 years ago
- ☆14May 14, 2019Updated 6 years ago
- Simple, fast unsupervised word aligner☆767Jul 19, 2022Updated 3 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Dec 8, 2022Updated 3 years ago
- Easy to use scripts for evaluating word vectors on a variety of tasks.☆119Mar 26, 2021Updated 4 years ago
- Gromov-Wasserstein Alignment of Embeddings☆68Sep 23, 2021Updated 4 years ago
- Moro files for the ACL 2015 Tutorial on Matrix and Tensor Factorization Methods for Natural Language Processing☆20Jul 29, 2015Updated 10 years ago
- Transition-based dependency parser based on stack LSTMs☆206Nov 17, 2019Updated 6 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆12Aug 10, 2023Updated 2 years ago