inspection-ai/japanese-toxic-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/inspection-ai/japanese-toxic-dataset)

inspection-ai / japanese-toxic-dataset

☆22

Alternatives and similar repositories for japanese-toxic-dataset

Users that are interested in japanese-toxic-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

megagonlabs / asdc
View on GitHub
Accommodation Search Dialog Corpus (宿泊施設探索対話コーパス)
☆25Jan 19, 2024Updated 2 years ago
ku-nlp / JMRD
View on GitHub
Japanese Movie Recommendation Dialogue dataset
☆29Jul 19, 2022Updated 4 years ago
aiishii / JEMHopQA
View on GitHub
☆30Apr 10, 2025Updated last year
singletongue / wikipedia-utils
View on GitHub
Utility scripts for preprocessing Wikipedia texts for NLP
☆78Apr 9, 2024Updated 2 years ago
tasukuigarashi / j-liwc2015
View on GitHub
Japanese version of LIWC2015
☆13Nov 6, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nobu-g / cohesion-analysis
View on GitHub
Code for COLING 2020 Paper
☆13Feb 3, 2026Updated 5 months ago
utanaka2000 / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆25Mar 16, 2021Updated 5 years ago
ku-nlp / ja-vicuna-qa-benchmark
View on GitHub
☆33Jul 31, 2024Updated last year
megagonlabs / ebe-dataset
View on GitHub
Evidence-based Explanation Dataset (AACL-IJCNLP 2020)
☆18Dec 17, 2020Updated 5 years ago
soramame0518 / j-mfd
View on GitHub
Japanese Moral Foundations Dictionary (J-MFD)
☆17Jan 12, 2022Updated 4 years ago
colorfulscoop / sbert-ja
View on GitHub
Code to train Sentence BERT Japanese model for Hugging Face Model Hub
☆11Aug 8, 2021Updated 4 years ago
masayu-a / WLSP-familiarity
View on GitHub
Word Familiarity Rate for 'Word List by Semantic Principles (WLSP)'
☆12Jan 2, 2025Updated last year
jqk09a / japanese-daily-dialogue
View on GitHub
☆60Mar 17, 2023Updated 3 years ago
DaisukeBekki / JSeM
View on GitHub
Japanese semantic test suite (FraCaS counterpart and extensions)
☆13Apr 21, 2026Updated 3 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
kunishou / do-not-answer-ja
View on GitHub
☆24Dec 15, 2023Updated 2 years ago
megagonlabs / instruction_ja
View on GitHub
Japanese instruction data (日本語指示データ)
☆24Jul 13, 2023Updated 3 years ago
HojiChar / HojiChar
View on GitHub
The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.
☆128Jul 17, 2026Updated last week
textlint-ja / technological-book-corpus-ja
View on GitHub
日本語で書かれた技術書を収集した生コーパス/ツール
☆26Apr 8, 2026Updated 3 months ago
WorksApplications / uzushio
View on GitHub
☆24Mar 18, 2026Updated 4 months ago
namhkoh / BAD-BiAs-Detection-in-LLMs
View on GitHub
BAD: BiAs Detection for Large Language Models in the context of candidate screening (EECS 692)
☆12Feb 14, 2024Updated 2 years ago
yahoojapan / JGLUE
View on GitHub
JGLUE: Japanese General Language Understanding Evaluation
☆346Mar 31, 2025Updated last year
tanreinama / Japanese-Fakenews-Dataset
View on GitHub
日本語フェイクニュースデータセット
☆21May 2, 2021Updated 5 years ago
osekilab / JCoLA
View on GitHub
☆19Apr 21, 2026Updated 3 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
megagonlabs / jrte-corpus
View on GitHub
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
☆77Jun 23, 2023Updated 3 years ago
ndl-lab / huriganacorpus-aozora
View on GitHub
青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット
☆22Jan 17, 2024Updated 2 years ago
kanekomasahiro / bias_eval_in_multiple_mlm
View on GitHub
☆11Jul 7, 2023Updated 3 years ago
cl-tohoku / keigo_transfer_task
View on GitHub
敬語変換タスクにおける評価用データセット
☆21Nov 24, 2022Updated 3 years ago
verypluming / JSICK
View on GitHub
Repository for JSICK
☆46May 31, 2023Updated 3 years ago
nlp-waseda / JMMLU
View on GitHub
日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark
☆40Oct 7, 2025Updated 9 months ago
CyberAgentAILab / camera
View on GitHub
Multimodal dataset for ad text generation in Japanese [Mita+, ACL2024]
☆26Aug 13, 2024Updated last year
Stability-AI / lm-evaluation-harness
View on GitHub
A framework for few-shot evaluation of autoregressive language models.
☆153Sep 13, 2024Updated last year
kondoumh / sb2md
View on GitHub
CLI to convert Scrapbox page to Markdown
☆12Jun 27, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
1never / open2ch-dialogue-corpus
View on GitHub
おーぷん2ちゃんねるをクロールして作成した対話コーパス
☆101Jun 6, 2021Updated 5 years ago
zuqqhi2 / docker-ml-python-sandbox
View on GitHub
Dockerfile for machine learning environment(scikit-learn, chainer, gensim, tensorflow, jupyter)
☆10Aug 16, 2018Updated 7 years ago
japanese-law-analysis / data_set
View on GitHub
法律・判例関係のデータセット
☆53Jan 8, 2025Updated last year
bzgeb / UnityUIDataBindingExample
View on GitHub
An example project to demonstrated UI Data Binding in Unity
☆13May 8, 2021Updated 5 years ago
verypluming / JaNLI
View on GitHub
☆17May 31, 2023Updated 3 years ago
osrg / optcast
View on GitHub
Reduction Server in Rust
☆14Apr 9, 2024Updated 2 years ago
iwiwi / epochraft-hf-fsdp
View on GitHub
Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP
☆11Jan 29, 2024Updated 2 years ago