alinear-corp/albert-japanese

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alinear-corp/albert-japanese)

alinear-corp / albert-japanese

BERT with SentencePiece for Japanese text.

☆33

Alternatives and similar repositories for albert-japanese

Users that are interested in albert-japanese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sonoisa / sentence-transformers
View on GitHub
Sentence Embeddings with BERT & XLNet
☆32Aug 25, 2023Updated 2 years ago
megagonlabs / ebe-dataset
View on GitHub
Evidence-based Explanation Dataset (AACL-IJCNLP 2020)
☆18Dec 17, 2020Updated 5 years ago
WorksApplications / uzushio
View on GitHub
☆24Mar 18, 2026Updated 4 months ago
colorfulscoop / sbert-ja
View on GitHub
Code to train Sentence BERT Japanese model for Hugging Face Model Hub
☆11Aug 8, 2021Updated 4 years ago
yoheikikuta / bert-japanese
View on GitHub
BERT with SentencePiece for Japanese text.
☆498Feb 15, 2021Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
megagonlabs / UD_Japanese-GSD
View on GitHub
Japanese data from the Google UDT 2.0.
☆28Mar 24, 2023Updated 3 years ago
megagonlabs / jrte-corpus
View on GitHub
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
☆77Jun 23, 2023Updated 3 years ago
yagays / nayose-wikipedia-ja
View on GitHub
Wikipediaから作成した日本語名寄せデータセット
☆35Mar 10, 2020Updated 6 years ago
yahoojapan / VFD-Dataset
View on GitHub
☆11Nov 10, 2020Updated 5 years ago
yukiB / rnntest
View on GitHub
Simple RNN test with TensorFLow
☆13May 22, 2018Updated 8 years ago
6 / kaomoji-json
View on GitHub
4000+ annotated 顔文字 (kaomoji) in JSON (UTF-8 & ShiftJIS)ヽ(`Д´*)ﾉ
☆27Jul 11, 2014Updated 12 years ago
kanjirz50 / termextract
View on GitHub
専門用語抽出アルゴリズムの実装の練習
☆18Sep 26, 2018Updated 7 years ago
cl-tohoku / bert-japanese
View on GitHub
BERT models for Japanese text.
☆551Mar 23, 2024Updated 2 years ago
megagonlabs / ginza-transformers
View on GitHub
Use custom tokenizers in spacy-transformers
☆16Aug 9, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
apsdehal / flava-tutorials
View on GitHub
Tutorials for FLAVA model https://arxiv.org/abs/2112.04482
☆12Jun 22, 2022Updated 4 years ago
danielvarab / massive-summ
View on GitHub
☆31Apr 21, 2023Updated 3 years ago
verypluming / JaNLI
View on GitHub
☆17May 31, 2023Updated 3 years ago
aiishii / JEMHopQA
View on GitHub
☆30Apr 10, 2025Updated last year
SkelterLabsInc / JaQuAD
View on GitHub
JaQuAD: Japanese Question Answering Dataset for Machine Reading Comprehension (2022, Skelter Labs)
☆111Mar 2, 2022Updated 4 years ago
kenta1984 / wrd
View on GitHub
☆23Sep 18, 2020Updated 5 years ago
kenkov / cabocha
View on GitHub
CaboCha wrapper for Python3
☆46Jul 5, 2018Updated 8 years ago
akirakubo / bert-japanese-aozora
View on GitHub
Japanese BERT trained on Aozora Bunko and Wikipedia, pre-tokenized by MeCab with UniDic & SudachiPy
☆40Aug 8, 2020Updated 5 years ago
Hironsan / IOB2Corpus
View on GitHub
Japanese IOB2 tagged corpus for Named Entity Recognition.
☆61Feb 25, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
apple / ml-vfi-smiff
View on GitHub
☆14Nov 5, 2025Updated 8 months ago
megagonlabs / instruction_ja
View on GitHub
Japanese instruction data (日本語指示データ)
☆24Jul 13, 2023Updated 3 years ago
Hironsan / ja.text8
View on GitHub
Japanese text8 corpus for word embedding.
☆111Oct 4, 2017Updated 8 years ago
nouu-me / document_vector_search_benchmark
View on GitHub
Benchmark for Japanese document embedding & vector search
☆29Mar 12, 2024Updated 2 years ago
bigcode-project / opt-out-v2
View on GitHub
Repository for opt-out requests.
☆12Updated this week
bokuweb / node-lcs-img-diff
View on GitHub
🖼 Image diff tool with LCS algorithm for Node.js
☆15Dec 15, 2023Updated 2 years ago
yagays / embedrank
View on GitHub
Python Implementation of EmbedRank
☆48Mar 19, 2019Updated 7 years ago
polm / unidic-lite
View on GitHub
A small version of UniDic for easy pip installs.
☆52Sep 1, 2020Updated 5 years ago
BruntonUWBio / ajile12-nwb-data
View on GitHub
☆15Feb 18, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SeanNobel / speech-decoding
View on GitHub
Reimplementation of speech decoding 2022 paper by MetaAI
☆14Oct 17, 2023Updated 2 years ago
jiefisher / matcher
View on GitHub
rule matcher (context free grammar)
☆10Dec 27, 2019Updated 6 years ago
newline-sandbox / vue-apollo-graphql
View on GitHub
A simple GitHub search client built with Vue 3 and Apollo.
☆11Mar 5, 2021Updated 5 years ago
nobu-g / cohesion-analysis
View on GitHub
Code for COLING 2020 Paper
☆13Feb 3, 2026Updated 5 months ago
ujiuji1259 / shinra-attribute-extraction
View on GitHub
☆11Sep 7, 2021Updated 4 years ago
windsuzu / Joint-Semantic-Phonetic-Embedding
View on GitHub
We use phonetics as a feature to create a joint semantic-phonetic embedding and improve the neural machine translation between Chinese an…
☆12Aug 3, 2021Updated 4 years ago
k-takano0423 / BiClass-Definition-Generator
View on GitHub
☆11Oct 20, 2024Updated last year