TurkuNLP / wikibert
BERT models for many languages created from Wikipedia texts
☆33Updated 4 years ago
Alternatives and similar repositories for wikibert:
Users that are interested in wikibert are comparing it to the libraries listed below
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Updated 6 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference☆62Updated 2 years ago
- ☆30Updated 4 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- ☆24Updated 5 years ago
- ☆17Updated last year
- Dependency Parsing as Sequence Labeling☆26Updated 7 months ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Converter from UD-trees to BART representation☆36Updated last year
- Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.☆50Updated 3 years ago
- Post-editing Datasets by Rakuten (PEDRa)☆14Updated 3 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 4 years ago
- ☆17Updated 2 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Updated 3 years ago
- Survey on machine learning.☆14Updated 4 years ago
- ☆33Updated 3 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Updated 4 years ago
- Code for the paper "Latent Relation Language Models" at AAAI-20.☆41Updated 4 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- ☆29Updated last year
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Updated 3 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 8 months ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31Updated 4 years ago
- Code and data for the paper "Soft Gazetteers for Low-resource Named Entity Recognition"☆19Updated 4 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 4 years ago
- Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.☆42Updated 5 years ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14Updated 4 years ago
- Codebase for probing and visualizing multilingual models.☆47Updated 4 years ago