ndl-lab/ndlngramdata

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ndl-lab/ndlngramdata)

ndl-lab / ndlngramdata

デジタル化資料から作成したOCRテキストデータのngram頻度統計情報のデータセット

☆17

Alternatives and similar repositories for ndlngramdata

Users that are interested in ndlngramdata are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TEI-EAJ / jp_guidelines
View on GitHub
TEIガイドラインへの準拠の仕方を日本語で解説します。
☆12Feb 15, 2021Updated 5 years ago
ndl-lab / pdmocrdataset-part2
View on GitHub
OCR処理プログラム研究開発事業において作成されたOCR学習用データセット
☆15Jun 26, 2024Updated 2 years ago
ndl-lab / hiragana_mojigazo
View on GitHub
文字画像データセット(平仮名73文字版)
☆18Apr 6, 2020Updated 6 years ago
yuta1984 / honkoku-data
View on GitHub
歴史資料の市民参加型翻刻プラットフォーム「みんなで翻刻」のテキストデータ置き場です。 / Transcription texts created on Minna de Honkoku (https://honkoku.org), a crowdsourced transc…
☆21Updated this week
nandenjin / itfdic
View on GitHub
A localized word dictionary asset for University of Tsukuba
☆12Sep 19, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
takahashim / trc_opendata
View on GitHub
TRC新刊図書オープンデータ非公式アーカイブ
☆13Nov 30, 2017Updated 8 years ago
ndl-lab / pdmocrdataset-part1
View on GitHub
デジタル化資料OCRテキスト化事業において作成されたOCR学習用データセット
☆83Jun 26, 2024Updated 2 years ago
FIWARE / tutorials.Identity-Management
View on GitHub
FIWARE 401: IDM - Managing Users and Organizations
☆10May 15, 2026Updated 2 months ago
sile / hone
View on GitHub
A shell-friendly hyperparameter search tool inspired by Optuna
☆18Dec 17, 2024Updated last year
performant-software / neatline-omeka-s
View on GitHub
A module for Omeka S that provides an API for the Neatline 3 single page application
☆18Mar 26, 2023Updated 3 years ago
mlpnlp / mlpnlp
View on GitHub
機械学習プロフェッショナルシリーズ深層学習による自然言語処理
☆36Jun 15, 2023Updated 3 years ago
shunk031 / human-attention-map-for-text-classification
View on GitHub
Reimplementation of the paper `Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? (ACL2…
☆17Jul 10, 2020Updated 6 years ago
ku-nlp / AnnotatedFKCCorpus
View on GitHub
Annotated Fuman Kaitori Center Corpus
☆18Dec 18, 2023Updated 2 years ago
recogito / recogito-client-core
View on GitHub
Core functions and components for RecogitoJS and Annotorious
☆16Nov 9, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
passaglia / yomikata
View on GitHub
Disambiguate japanese heteronyms
☆34Oct 3, 2023Updated 2 years ago
ndc-dev / ndc-ime-dic
View on GitHub
日本十進分類法のIME辞書
☆11Dec 8, 2022Updated 3 years ago
megagonlabs / ginza-transformers
View on GitHub
Use custom tokenizers in spacy-transformers
☆16Aug 9, 2022Updated 3 years ago
megagonlabs / ebe-dataset
View on GitHub
Evidence-based Explanation Dataset (AACL-IJCNLP 2020)
☆18Dec 17, 2020Updated 5 years ago
sj-doyle / NGSI-LD-Entities
View on GitHub
These are a set of data definitions for harmonising the data from IoT and related context data sources. They have been developed through …
☆16Oct 3, 2019Updated 6 years ago
victoprincipe / Unity-3D-Magnus-Effect
View on GitHub
Magnus Effect Simulation
☆10Aug 6, 2018Updated 7 years ago
octanove / shiba
View on GitHub
Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.
☆89Nov 3, 2023Updated 2 years ago
calmery / vrchat
View on GitHub
Unofficial VRChat API Client 🤫
☆11Aug 12, 2021Updated 4 years ago
assemblerbot / pico8
View on GitHub
Pico8 games and tools
☆13Nov 7, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
CALIL / openbookcamera
View on GitHub
高速な書影撮影システム「オープンブックカメラ」
☆23May 21, 2026Updated 2 months ago
makora9143 / deterministic-variational-inference-pytorch
View on GitHub
☆13Mar 16, 2019Updated 7 years ago
musyoku / unsupervised-pos-tagging
View on GitHub
教師なし品詞タグ推定
☆16Mar 22, 2018Updated 8 years ago
ndl-lab / ndlkotenocr_cli
View on GitHub
NDL古典籍OCRのアプリケーション（ソースコードを含む）
☆98Oct 14, 2025Updated 9 months ago
tshimada291 / gtfs-jp-list-datecheck
View on GitHub
GTFS/GTFS-JP固定URLデータ　日付チェック
☆11Updated this week
sotokisehiro / chrome-llm-sample
View on GitHub
Google Chromeの内蔵ローカルLLMでチャットするためのサンプルコードです。
☆13Jan 15, 2025Updated last year
Cekay3D / UdonWaterInteractions
View on GitHub
Make your VRChat world's water more immersive with sounds and particle systems that provide feedback for your hands, head, and body.
☆10Oct 6, 2023Updated 2 years ago
Cyrusky / hexo-backlink
View on GitHub
This plugin is for transfer Obsidian-type backlink to standard hexo in-site post link.
☆19Feb 22, 2024Updated 2 years ago
ndl-lab / ndl-minhon-ocrdataset
View on GitHub
NDL古典籍OCR学習用データセット（みんなで翻刻加工データ）
☆22Mar 13, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
himkt / interest
View on GitHub
👀 Interest: Organizing papers+materials which you are interested in. Serverless application powered by GitHub pages + Google Spreadshee…
☆16Jan 7, 2023Updated 3 years ago
Codel1417 / VRC-Looking-Glass
View on GitHub
Displays VRChat avatars from your download cache on a Looking Glass display.
☆14Mar 21, 2022Updated 4 years ago
musyoku / gqn-dataset-renderer
View on GitHub
☆26Jul 19, 2019Updated 7 years ago
nagai-takayuki / Android
View on GitHub
☆14Jan 11, 2013Updated 13 years ago
ndl-lab / ndlocr_cli
View on GitHub
NDLOCRアプリケーションのリポジトリ（ソースコードを含む）
☆678Jan 5, 2026Updated 6 months ago
TEI-EAJ / aozora_tei
View on GitHub
青空文庫テキストをより便利にする（機械可読性を高める）ためのプロジェクト
☆26Jun 23, 2026Updated last month
ikegami-yukino / sengiri
View on GitHub
Yet another sentence-level tokenizer for the Japanese text
☆24Nov 27, 2025Updated 7 months ago