utanaka2000/fairseq

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/utanaka2000/fairseq)

utanaka2000 / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

☆25

Alternatives and similar repositories for fairseq

Users that are interested in fairseq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

megagonlabs / t5-japanese
View on GitHub
Codes to pre-train Japanese T5 models
☆40Sep 7, 2021Updated 4 years ago
megagonlabs / ebe-dataset
View on GitHub
Evidence-based Explanation Dataset (AACL-IJCNLP 2020)
☆18Dec 17, 2020Updated 5 years ago
Katsumata420 / wikihow_japanese
View on GitHub
☆35Dec 17, 2020Updated 5 years ago
inspection-ai / japanese-toxic-dataset
View on GitHub
☆22Jan 11, 2023Updated 3 years ago
KoichiYasuoka / SuPar-UniDic
View on GitHub
Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese with BERT models
☆21Feb 28, 2026Updated 4 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ku-nlp / JMRD
View on GitHub
Japanese Movie Recommendation Dialogue dataset
☆29Jul 19, 2022Updated 4 years ago
singletongue / wikipedia-utils
View on GitHub
Utility scripts for preprocessing Wikipedia texts for NLP
☆78Apr 9, 2024Updated 2 years ago
nict-wisdom / rannc
View on GitHub
RaNNC is an automatic parallelization middleware used to train very large-scale neural networks.
☆57Oct 15, 2022Updated 3 years ago
yamaru12345 / nlp100
View on GitHub
言語処理100本ノック 2020
☆30Nov 19, 2020Updated 5 years ago
megagonlabs / asdc
View on GitHub
Accommodation Search Dialog Corpus (宿泊施設探索対話コーパス)
☆25Jan 19, 2024Updated 2 years ago
nobu-g / cohesion-analysis
View on GitHub
Code for COLING 2020 Paper
☆13Feb 3, 2026Updated 5 months ago
megagonlabs / jrte-corpus
View on GitHub
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
☆77Jun 23, 2023Updated 3 years ago
KodairaTomonori / ThreeLineSummaryDataset
View on GitHub
☆31Apr 4, 2018Updated 8 years ago
aiishii / JEMHopQA
View on GitHub
☆30Apr 10, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
colorfulscoop / sbert-ja
View on GitHub
Code to train Sentence BERT Japanese model for Hugging Face Model Hub
☆11Aug 8, 2021Updated 4 years ago
DaisukeBekki / JSeM
View on GitHub
Japanese semantic test suite (FraCaS counterpart and extensions)
☆13Apr 21, 2026Updated 3 months ago
retarfi / language-pretraining
View on GitHub
Pre-training Language Models for Japanese
☆50Jul 2, 2023Updated 3 years ago
kzinmr / transformers_ner_ja
View on GitHub
Japanese NER with Transformers + PyTorch-Lightning + MLflow Tracking
☆15Nov 20, 2022Updated 3 years ago
yahoojapan / VFD-Dataset
View on GitHub
☆11Nov 10, 2020Updated 5 years ago
tanreinama / gpt2-japanese
View on GitHub
Japanese GPT2 Generation Model
☆323Sep 2, 2023Updated 2 years ago
ids-cv / wrime
View on GitHub
☆177Sep 11, 2025Updated 10 months ago
ou-medinfo / medbertjp
View on GitHub
Trials of pre-trained BERT models for the medical domain in Japanese.
☆13Nov 21, 2020Updated 5 years ago
WorksApplications / uzushio
View on GitHub
☆24Mar 18, 2026Updated 4 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ikuyamada / wikipedia-nlp
View on GitHub
Sample code for natural language processing using Wikipedia
☆19Oct 23, 2018Updated 7 years ago
sonoisa / t5-japanese
View on GitHub
日本語T5モデル
☆118Sep 15, 2025Updated 10 months ago
ku-nlp / AnnotatedFKCCorpus
View on GitHub
Annotated Fuman Kaitori Center Corpus
☆18Dec 18, 2023Updated 2 years ago
osekilab / JCoLA
View on GitHub
☆19Apr 21, 2026Updated 3 months ago
natsuakane / Yet
View on GitHub
☆14Sep 15, 2025Updated 10 months ago
ku-nlp / VISA
View on GitHub
An ambiguous subtitles dataset for visual scene-aware machine translation
☆14Oct 17, 2022Updated 3 years ago
cl-tohoku / keigo_transfer_task
View on GitHub
敬語変換タスクにおける評価用データセット
☆21Nov 24, 2022Updated 3 years ago
shimo-lab / sembei
View on GitHub
単語分割を経由しない単語埋め込み
☆14Mar 19, 2017Updated 9 years ago
ndl-lab / huriganacorpus-aozora
View on GitHub
青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット
☆22Jan 17, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
informatix-inc / bert
View on GitHub
☆28Apr 5, 2022Updated 4 years ago
nttcslab / japanese-dialog-transformers
View on GitHub
Code for evaluating Japanese pretrained models provided by NTT Ltd.
☆246Jun 21, 2023Updated 3 years ago
yahoojapan / JGLUE
View on GitHub
JGLUE: Japanese General Language Understanding Evaluation
☆346Mar 31, 2025Updated last year
CyberAgentAILab / camera
View on GitHub
Multimodal dataset for ad text generation in Japanese [Mita+, ACL2024]
☆26Aug 13, 2024Updated last year
akirakubo / bert-japanese-aozora
View on GitHub
Japanese BERT trained on Aozora Bunko and Wikipedia, pre-tokenized by MeCab with UniDic & SudachiPy
☆40Aug 8, 2020Updated 5 years ago
Language-Media-Lab / commonsense-moral-ja
View on GitHub
☆15Nov 20, 2025Updated 8 months ago
verypluming / JaNLI
View on GitHub
☆17May 31, 2023Updated 3 years ago