venali / BilingualCorpus
☆22Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for BilingualCorpus
- A large parallel corpus of English and Japanese☆79Updated 7 years ago
- Keyaki Treebank Parsed Corpus☆10Updated 5 years ago
- A phenomenon-wise evaluation dataset for Japanese-English machine translation robustness. The dataset is based on the MTNT dataset, with …☆14Updated 3 years ago
- Implementation of unsupervised smoothed inverse frequency (Best Paper, Repl4NLP @ ACL 2018)☆77Updated 5 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆39Updated 5 years ago
- ⚡️ AllenNLP plugin for adding subcommands to use Optuna, making hyperparameter optimization easy☆33Updated 2 years ago
- BERT-based GEC tagging for Japanese☆16Updated last year
- ☆34Updated 3 years ago
- Python package for understanding the difficulty of text classification datasets. (in CoNNL 2018)☆63Updated 3 years ago
- An example usage of JParaCrawl pre-trained Neural Machine Translation (NMT) models.☆103Updated 3 years ago
- Codes to pre-train Japanese T5 models☆40Updated 3 years ago
- Darts-clone python binding☆20Updated 2 years ago
- 単語分割を経由しない単語埋め込み☆14Updated 7 years ago
- Japanese BERT trained on Aozora Bunko and Wikipedia, pre-tokenized by MeCab with UniDic & SudachiPy☆40Updated 4 years ago
- Japanese data from the Google UDT 2.0.☆36Updated last week
- Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.☆90Updated last year
- 本リポジトリは「AllenNLP入門」のソースコード置き場です。☆37Updated last year
- A processor for KyotoCorpus, KWDLC, and AnnotatedFKCCorpus☆10Updated 4 months ago
- Character Based Named Entity Recognition.☆41Updated 6 years ago
- 🚀 A demonstration of hyperparameter optimization using Optuna for models implemented with AllenNLP.☆16Updated 3 years ago
- Capturing Set-Theoretic Semantics of Words using Box Embeddings☆34Updated 2 years ago
- Japanese data from the Google UDT 2.0.☆28Updated last year
- The Business Scene Dialogue corpus☆68Updated 3 years ago
- Direct Attentive Dependency Parser☆51Updated 8 months ago
- NIILC QA data☆17Updated 9 years ago
- ☆46Updated 2 years ago
- A simple implementation of SimCSE☆74Updated 2 years ago
- Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.☆85Updated 2 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆132Updated last year
- Dataset for the Emerging & Novel Entity NER task (WNUT '17)☆111Updated 2 years ago