ndl-lab/huriganacorpus-ndlbib

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ndl-lab/huriganacorpus-ndlbib)

ndl-lab / huriganacorpus-ndlbib

全国書誌データから作成した振り仮名のデータセット

☆32

Alternatives and similar repositories for huriganacorpus-ndlbib

Users that are interested in huriganacorpus-ndlbib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ndl-lab / huriganacorpus-aozora
View on GitHub
青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット
☆22Jan 17, 2024Updated 2 years ago
noir55 / voicevox_cli_client
View on GitHub
VOICEVOX ENGINE、VOICEVOX NEMO ENGINE、COEIROINK用コマンドラインクライアント。複数のエンジンを使用した並列処理もできます
☆11May 4, 2024Updated 2 years ago
tanreinama / Japanese-BPEEncoder
View on GitHub
Japanese-BPEEncoder
☆42Sep 12, 2021Updated 4 years ago
solaoi / voicepeaky4gpt
View on GitHub
This is a server to use voicepeak as api. By using openAI's API, emotions, voice pitch, and speed are automatically adjusted.
☆19Nov 12, 2023Updated 2 years ago
Japanese-Accent-Circle / japanese-accent
View on GitHub
☆12Jan 11, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ku-nlp / AnnotatedFKCCorpus
View on GitHub
Annotated Fuman Kaitori Center Corpus
☆18Dec 18, 2023Updated 2 years ago
yamachu / julius4seg
View on GitHub
Juliusを使ったセグメンテーション支援ツール
☆14Feb 13, 2020Updated 6 years ago
solaoi / voicepeaky
View on GitHub
This is a server to use voicepeak as api.
☆28Nov 12, 2023Updated 2 years ago
daac-tools / vaporetto
View on GitHub
🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer
☆297Jul 20, 2026Updated last week
azooKey / AJIMEE-Bench
View on GitHub
AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)
☆23Jan 13, 2025Updated last year
rahmanidashti / pretrain-lightfm
View on GitHub
Pre-train Embedding in LightFM Recommender System Framework
☆11Apr 28, 2019Updated 7 years ago
ndl-lab / tugidigi-web
View on GitHub
次世代デジタルライブラリーのソースコード(Programs of the Next Digital Library.)
☆26Apr 30, 2026Updated 2 months ago
shi3z / BitNet
View on GitHub
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
☆20Mar 2, 2024Updated 2 years ago
himkt / awesome-bert-japanese
View on GitHub
📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information
☆132Mar 15, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ndl-lab / dataprocessingforpdf
View on GitHub
PDFからテキストデータを抽出して機械学習等に適用するためのツール群
☆12Aug 4, 2021Updated 4 years ago
shogo82148 / TinySegmenterMaker
View on GitHub
☆73Sep 30, 2022Updated 3 years ago
ku-nlp / WikipediaAnnotatedCorpus
View on GitHub
☆30Jul 1, 2026Updated 3 weeks ago
Yukaryavka / rinna_gpt-neox_ggml-lora
View on GitHub
The repository contains scripts and merge scripts that have been modified to adapt an Alpaca-Lora adapter for LoRA tuning when assuming t…
☆19May 24, 2023Updated 3 years ago
joisino / chainer-ETTTS
View on GitHub
This is an implementation of "Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention" wit…
☆28Dec 23, 2017Updated 8 years ago
Stability-AI / gpt-neox
View on GitHub
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
☆13Jun 7, 2023Updated 3 years ago
HojiChar / HojiChar
View on GitHub
The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.
☆128Jul 17, 2026Updated last week
kenall-inc / jntajis-python
View on GitHub
A fast character conversion and transliteration library based on the scheme defined for Japan National Tax Agency (国税庁) 's corporate numb…
☆21Mar 11, 2026Updated 4 months ago
ndl-lab / layout-dataset
View on GitHub
NDL-DocLデータセット(資料画像レイアウトデータセット)
☆30Mar 2, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
NISTEP / minutes
View on GitHub
議事録メタデータセット
☆12Jun 10, 2018Updated 8 years ago
pixiv / .editorconfig
View on GitHub
☆16May 21, 2019Updated 7 years ago
PenguinCabinet / mama-katu-DM-corpus
View on GitHub
The corpus of Japanese spam messages of invitation Mama Katu.
☆42Aug 1, 2025Updated 11 months ago
Takeuchi-Lab-LM / python_asa
View on GitHub
python版日本語意味役割付与システム（ASA）
☆22Nov 11, 2022Updated 3 years ago
oatsu-gh / enunu_kodoku_singing
View on GitHub
22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。
☆15Aug 7, 2022Updated 3 years ago
dan-wells / fastpitch
View on GitHub
NVIDIA's FastPitch, extracted from the DeepLearningExamples repository
☆14Mar 29, 2024Updated 2 years ago
aike / audiolang
View on GitHub
Audio Language Examples
☆20Dec 26, 2020Updated 5 years ago
singletongue / wikipedia-utils
View on GitHub
Utility scripts for preprocessing Wikipedia texts for NLP
☆78Apr 9, 2024Updated 2 years ago
youichiro / transformer-copy
View on GitHub
日本語文法誤り訂正ツール
☆29Jun 22, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ikawaha / kagome-dict
View on GitHub
Dictionary Library for Kagome v2
☆15Jul 9, 2026Updated 2 weeks ago
polm / deltos
View on GitHub
A magic notepad. δ
☆14May 21, 2023Updated 3 years ago
KoichiYasuoka / UniDic2UD
View on GitHub
Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese
☆38Dec 29, 2025Updated 7 months ago
cbottrell / HPC_IPMU
View on GitHub
High-performance computing using Kavli IPMU clusters (specifically idark).
☆13Apr 20, 2023Updated 3 years ago
6gsn / marine
View on GitHub
☆38Sep 20, 2022Updated 3 years ago
neologd / mecab-unidic-neologd
View on GitHub
Neologism dictionary based on the language resources on the Web for mecab-unidic
☆88Sep 14, 2020Updated 5 years ago
tanreinama / gpt2-japanese
View on GitHub
Japanese GPT2 Generation Model
☆323Sep 2, 2023Updated 2 years ago