Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages
☆15Apr 11, 2020Updated 5 years ago
Alternatives and similar repositories for embeddings
Users that are interested in embeddings are comparing it to the libraries listed below
Sorting:
- Code for "Bilingual Lexicon Induction with Semi-supervisionin Non-Isometric Embedding Spaces", ACL 2019☆14Sep 15, 2019Updated 6 years ago
- MultiLexNorm 2021 competition system from ÚFAL☆16Dec 30, 2021Updated 4 years ago
- ☆11Nov 14, 2021Updated 4 years ago
- A tool for text normalisation via character-level machine translation☆13Jun 12, 2020Updated 5 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Feb 1, 2020Updated 6 years ago
- NJUNMT for docNMT☆16Sep 9, 2020Updated 5 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- Reference-free MT Evaluation Metrics☆20Sep 24, 2022Updated 3 years ago
- Aligned bilingual word vectors for English and Chinese☆11Jun 25, 2018Updated 7 years ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆24Aug 23, 2019Updated 6 years ago
- Python port of the Pink Trombone JS code☆18Jan 30, 2022Updated 4 years ago
- ☆10Sep 13, 2022Updated 3 years ago
- Code and resources for evaluating cross-lingual embedding spaces☆29Apr 7, 2020Updated 5 years ago
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆21Jan 17, 2020Updated 6 years ago
- ☆14May 15, 2020Updated 5 years ago
- ☆51Jul 25, 2024Updated last year
- ☆93Feb 13, 2024Updated 2 years ago
- The repository of CLEME (EMNLP 2023) and CLEME2.0 (ACL 2025)☆12May 17, 2025Updated 10 months ago
- A series of BERT and Albert model checkpoints trained to reduce gendered correlations in pre-training☆11Oct 22, 2020Updated 5 years ago
- ☆10May 26, 2022Updated 3 years ago
- ISI tutorials☆12Oct 28, 2016Updated 9 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- DQN with freezing target network in tensorflow on pygame FlappyBird☆11Dec 19, 2018Updated 7 years ago
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Jan 4, 2021Updated 5 years ago
- Morphological analysis for Udmurt.☆12Feb 17, 2026Updated last month
- Source code accompanying the ICLR2020 publication 'Massively Multilingual Sparse Word Representations' https://openreview.net/forum?id=Hy…☆12Aug 15, 2023Updated 2 years ago
- A library for evaluation of Grammatical Error Correction (GEC). Accepted to ACL'25 Demo: "gec-metrics: A Unified Library for Grammatical …☆14Jan 25, 2026Updated last month
- [Kauf & Ivanova, ACL 2023] A Better Way to Do Masked Language Model Scoring☆11Dec 1, 2023Updated 2 years ago
- материалы курса по питону для студентов дпо-программы "компьютерная лингвистика" в НИУ ВШЭ (2020-2021)☆11Feb 21, 2022Updated 4 years ago
- Part-of-speech tagging using BERT☆10Nov 14, 2019Updated 6 years ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Oct 27, 2022Updated 3 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆25Dec 14, 2025Updated 3 months ago
- ☆15Apr 12, 2023Updated 2 years ago
- PyTorch implementation of the RCSLS cross-lingual word embedding alignment method☆12May 1, 2019Updated 6 years ago
- ☆13Jul 26, 2023Updated 2 years ago