MilaNLProc / language-invariant-properties
☆22Updated 3 years ago
Alternatives and similar repositories for language-invariant-properties:
Users that are interested in language-invariant-properties are comparing it to the libraries listed below
- Source code and data for Like a Good Nearest Neighbor☆28Updated 3 months ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Updated 3 years ago
- ☆11Updated 4 months ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- codebase for the Text-based NP Enrichment (TNE) paper☆20Updated last year
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Updated 3 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆18Updated 4 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆86Updated this week
- Embedding Recycling for Language models☆38Updated last year
- ☆21Updated 3 years ago
- ☆22Updated 2 months ago
- ☆19Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 3 years ago
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆21Updated 2 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆16Updated 3 weeks ago
- Combining encoder-based language models☆11Updated 3 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆19Updated 2 months ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Updated last month
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆45Updated last year
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated last year
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 9 months ago
- Semantically Structured Sentence Embeddings☆65Updated 5 months ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…☆16Updated 3 years ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14Updated 4 years ago