Cinnamon / electra_japanese

☆49

Related projects: ⓘ

megagonlabs / jrte-corpus
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
☆75Updated last year
nekoumei / DocumentClassificationUsingBERT-Japanese
☆40Updated 3 years ago
hottolink / hottoSNS-bert
hottoSNS-BERT: 大規模SNSコーパスによる文分散表現モデル
☆60Updated 3 years ago
chakki-works / Japanese-Company-Lexicon
☆94Updated last year
kajyuuen / daaja
This repository has implementations of data augmentation for NLP for Japanese.
☆63Updated last year
himkt / awesome-bert-japanese
📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information
☆129Updated last year
Kosuke-Szk / ja_text_bert
日本語WikipediaコーパスでBERTのPre-Trainedモデルを生成するためのリポジトリ
☆115Updated 5 years ago
stockmarkteam / ner-wikipedia-dataset
Wikipediaを用いた日本語の固有表現抽出データセット
☆132Updated last year
WorksApplications / chikkarpy
Japanese synonym library
☆51Updated 2 years ago
cl-tohoku / JAQKET_baseline
☆16Updated 2 years ago
1never / open2ch-dialogue-corpus
おーぷん2ちゃんねるをクロールして作成した対話コーパス
☆93Updated 3 years ago
wwwcojp / ja_sentence_segmenter
japanese sentence segmentation library for python
☆65Updated last year
upura / nlp-recipes-ja
Samples codes for natural language processing in Japanese
☆63Updated last year
Katsumata420 / wikihow_japanese
☆35Updated 3 years ago
ikuyamada / wikipedia-nlp
Sample code for natural language processing using Wikipedia
☆19Updated 5 years ago
yagays / ja-timex
自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器
☆132Updated 7 months ago
KodairaTomonori / ThreeLineSummaryDataset
☆30Updated 6 years ago
osuossu8 / Utils
☆33Updated 3 years ago
WorksApplications / SudachiTra
Japanese tokenizer for Transformers
☆77Updated 9 months ago
megagonlabs / UD_Japanese-GSD
Japanese data from the Google UDT 2.0.
☆28Updated last year
sonoisa / t5-japanese
日本語T5モデル
☆111Updated last year
taishi-i / toiro
A comparison tool of Japanese tokenizers
☆117Updated 3 months ago
halhorn / deep_dialog_tutorial
tutorial for deep learning dialogue models
☆75Updated last year
chakki-works / chABSA-dataset
chakki's Aspect-Based Sentiment Analysis dataset
☆136Updated 2 years ago
yagays / pytorch_bert_japanese
☆34Updated 4 years ago
nn116003 / torchtext-tutorial
torchtext-tutorial (text classification)
☆32Updated 6 years ago
yagays / nayose-wikipedia-ja
Wikipediaから作成した日本語名寄せデータセット
☆34Updated 4 years ago
sonoisa / sentence-transformers
Sentence Embeddings with BERT & XLNet
☆32Updated last year
ku-nlp / pyknp
A Python Module for JUMAN++/KNP
☆88Updated 2 months ago
Hironsan / ja.text8
Japanese text8 corpus for word embedding.
☆109Updated 6 years ago