antonisa/embeddings

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/antonisa/embeddings)

antonisa / embeddings

Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages

☆15

Alternatives and similar repositories for embeddings

Users that are interested in embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ehudreiter / accuracySharedTask
View on GitHub
Shared task on evaluating accuracy
☆16Sep 22, 2021Updated 4 years ago
ufal / multilexnorm2021
View on GitHub
MultiLexNorm 2021 competition system from ÚFAL
☆16Dec 30, 2021Updated 4 years ago
ruathudo / post-ocr-correction
View on GitHub
☆11Nov 14, 2021Updated 4 years ago
clarinsi / csmtiser
View on GitHub
A tool for text normalisation via character-level machine translation
☆13Jun 12, 2020Updated 6 years ago
ltgoslo / simple_elmo_training
View on GitHub
Minimal code to train ELMo models in recent versions of TensorFlow
☆14Jun 16, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
stickeritis / sticker2
View on GitHub
Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot
☆13Dec 18, 2020Updated 5 years ago
Blickwinkel1107 / making-the-most-of-context-nmt
View on GitHub
NJUNMT for docNMT
☆16Sep 9, 2020Updated 5 years ago
UKPLab / naacl2019-does-my-rebuttal-matter
View on GitHub
☆28Jul 29, 2023Updated 2 years ago
lenakmeth / Wikinflection-Corpus
View on GitHub
The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…
☆12Dec 15, 2023Updated 2 years ago
AIPHES / ACL20-Reference-Free-MT-Evaluation
View on GitHub
Reference-free MT Evaluation Metrics
☆20Sep 24, 2022Updated 3 years ago
AlexeySorokin / NeuralMorphemeSegmentation
View on GitHub
Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language
☆25Aug 23, 2019Updated 6 years ago
kmike / dialog2017
View on GitHub
☆10Jul 21, 2017Updated 8 years ago
tatHi / maxmatch_dropout
View on GitHub
☆10Sep 13, 2022Updated 3 years ago
codogogo / xling-eval
View on GitHub
Code and resources for evaluating cross-lingual embedding spaces
☆29Apr 7, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mfaruqui / morph-trans
View on GitHub
Code for morphological transformations
☆29Jun 3, 2017Updated 9 years ago
antonisa / inflection
View on GitHub
Morphological Inflection for Low-Resource Languages using cross-lingual transfer
☆21Jan 17, 2020Updated 6 years ago
rewicks / ersatz
View on GitHub
☆51Jul 25, 2024Updated last year
pedrada88 / crossembeddings-twitter
View on GitHub
☆14May 15, 2020Updated 6 years ago
THUKElab / CLEME
View on GitHub
The repository of CLEME (EMNLP 2023) and CLEME2.0 (ACL 2025)
☆12May 17, 2025Updated last year
google-research-datasets / Zari
View on GitHub
A series of BERT and Albert model checkpoints trained to reduce gendered correlations in pre-training
☆11Oct 22, 2020Updated 5 years ago
gmichalo / LexSubCon
View on GitHub
☆10May 26, 2022Updated 4 years ago
isi-nlp / tutorials
View on GitHub
ISI tutorials
☆12Oct 28, 2016Updated 9 years ago
spraakbanken / multiged-2023
View on GitHub
☆15Apr 12, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
cisnlp / parcoure
View on GitHub
ParCourE - Parallel Corpus Explorer
☆12Dec 27, 2021Updated 4 years ago
MSR-LIT / MultilingualBias
View on GitHub
☆10Jul 6, 2023Updated 3 years ago
GaryYufei / ACL2021MF
View on GitHub
Source Code For ACL 2021 Paper "Mention Flags (MF): Constraining Transformer-based Text Generators"
☆20Oct 4, 2021Updated 4 years ago
initial-h / FlappyBird_DQN_with_target_network
View on GitHub
DQN with freezing target network in tensorflow on pygame FlappyBird
☆11Dec 19, 2018Updated 7 years ago
passing2961 / DialogCC
View on GitHub
Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…
☆13Jun 24, 2024Updated 2 years ago
SapienzaNLP / clubert
View on GitHub
Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.
☆10Jan 4, 2021Updated 5 years ago
THUKElab / MixEdit
View on GitHub
The repository of EMNLP 2023 "MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction"
☆12Nov 25, 2023Updated 2 years ago
timarkh / uniparser-grammar-udm
View on GitHub
Morphological analysis for Udmurt.
☆12May 23, 2026Updated last month
INK-USC / expl-refinement
View on GitHub
Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)
☆11Oct 25, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
begab / mamus
View on GitHub
Source code accompanying the ICLR2020 publication 'Massively Multilingual Sparse Word Representations' https://openreview.net/forum?id=Hy…
☆12Aug 15, 2023Updated 2 years ago
anklowait / python_for_CL
View on GitHub
материалы курса по питону для студентов дпо-программы "компьютерная лингвистика" в НИУ ВШЭ (2020-2021)
☆12Feb 21, 2022Updated 4 years ago
carina-kauf / better-mlm-scoring
View on GitHub
[Kauf & Ivanova, ACL 2023] A Better Way to Do Masked Language Model Scoring
☆12Dec 1, 2023Updated 2 years ago
spyysalo / bert-pos
View on GitHub
Part-of-speech tagging using BERT
☆10Nov 14, 2019Updated 6 years ago
lilt / alignment-scripts
View on GitHub
Scripts to preprocess training and test data and to run fast_align and giza
☆107Nov 2, 2021Updated 4 years ago
ehsanasgari / 1000Langs
View on GitHub
Creating super-parallel corpora of more than 1500+ unique languages for NLP research
☆33Dec 8, 2022Updated 3 years ago
phalodi / Email_Spam_Spark
View on GitHub
In this small project we will predict the email that in which folder it will go in spam or primary.
☆11Jul 5, 2016Updated 10 years ago