Codebase for probing and visualizing multilingual models.
☆49May 13, 2020Updated 5 years ago
Alternatives and similar repositories for multilingual-probing-visualization
Users that are interested in multilingual-probing-visualization are comparing it to the libraries listed below
Sorting:
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Jun 12, 2023Updated 2 years ago
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Sep 13, 2023Updated 2 years ago
- ☆13Oct 3, 2024Updated last year
- decontamination☆26Mar 4, 2026Updated 2 weeks ago
- Code used for the paper "Linguistic Features for Readability Assessment" (Deutsch, Jasbi, and Shieber 2020)☆25Jul 19, 2021Updated 4 years ago
- ☆14Apr 8, 2021Updated 4 years ago
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆18Aug 17, 2021Updated 4 years ago
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 5 years ago
- A python 3 interface for BabelNet https://babelnet.org/☆33Jan 27, 2023Updated 3 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- Repo for the Unified Verbs Index Project☆12Feb 3, 2026Updated last month
- Source code for ACL2020: On the Robustness of Language Encoders against Grammatical Errors☆10Jul 6, 2023Updated 2 years ago
- A tiny BERT for low-resource monolingual models☆31Dec 24, 2025Updated 2 months ago
- Code Repository for "A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models".☆15Oct 14, 2022Updated 3 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11May 7, 2020Updated 5 years ago
- This repo includes our code for evaluating and improving transferability in domain generalization (NeurIPS 2021)☆13Nov 1, 2022Updated 3 years ago
- PyTorch implementation of the RCSLS cross-lingual word embedding alignment method☆12May 1, 2019Updated 6 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Pretraining scripts for BART transformer model☆12May 15, 2023Updated 2 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Jun 17, 2024Updated last year
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 3 years ago
- PyTorch implementation of ACL paper https://arxiv.org/abs/1906.02656☆25Jun 12, 2023Updated 2 years ago
- Syntactic evaluation sets, attribute-varying grammars, and code for replicating the CLAMS paper. ACL 2020.☆17Nov 26, 2024Updated last year
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Oct 29, 2022Updated 3 years ago
- Data Collection System For NLP/Speech Recognition☆25Apr 20, 2021Updated 4 years ago
- Getting interpretable dimensions in word embedding spaces.☆15Jul 6, 2023Updated 2 years ago
- ☆16Dec 14, 2022Updated 3 years ago
- ☆22Apr 13, 2018Updated 7 years ago
- Detect individual instruments activity in an audio file. 🎤🎹🎸🥁☆16Jun 29, 2021Updated 4 years ago
- ☆14Dec 3, 2019Updated 6 years ago
- NER bot 2.0☆12Jul 27, 2021Updated 4 years ago
- Implementation of paper "Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval"☆17Jan 10, 2022Updated 4 years ago
- Code and Results for "Universals of word order reflect optimization of grammars for efficient communication"☆14Aug 5, 2022Updated 3 years ago
- Chu-Lui-Edmonds decoding extracted from TurboParser☆14May 16, 2017Updated 8 years ago
- saved models for spleeter (tf and tfjs)☆16Jan 30, 2020Updated 6 years ago
- Language identification and normalisation in code switching data tailored with a three-step decoding process☆24Dec 23, 2019Updated 6 years ago
- Lot Of Indic Tweets☆13Oct 4, 2019Updated 6 years ago
- Exploring semantic similarities between contextualized embeddings☆14May 18, 2021Updated 4 years ago