デジタル化資料OCRテキスト化事業において作成されたOCR学習用データセット
☆83Jun 26, 2024Updated last year
Alternatives and similar repositories for pdmocrdataset-part1
Users that are interested in pdmocrdataset-part1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NDL-DocLデータセット(資料画像レイアウトデータセット)☆30Mar 2, 2023Updated 3 years ago
- NDLOCRアプリケーションのリポジトリ(ソースコードを含む)☆675Jan 5, 2026Updated 5 months ago
- NDL古典籍OCR学習用データセット(みんなで翻刻加工データ)☆20Mar 13, 2026Updated 3 months ago
- 文字画像データセット(平仮名73文字版)☆18Apr 6, 2020Updated 6 years ago
- デジタル化資料から作成したOCRテキストデータのngram頻度統計情報のデータセット☆17Jan 10, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- OCR処理プログラム研究開発事業において作成されたOCR学習用データセット☆15Jun 26, 2024Updated last year
- ☆30Apr 10, 2025Updated last year
- ☆19Feb 9, 2025Updated last year
- 次世代デジタルライブラリーのソースコード(Programs of the Next Digital Library.)☆26Apr 30, 2026Updated last month
- Show notes for https://anchor.fm/yoheikikuta.☆15Apr 24, 2022Updated 4 years ago
- Google Chromeの内蔵ローカルLLMでチャットするためのサンプルコードです。☆13Jan 15, 2025Updated last year
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆24Mar 19, 2023Updated 3 years ago
- Ono laboratory audio signal processing exercise for beginners.☆19May 10, 2023Updated 3 years ago
- NLP2025 のチュートリアル「地理情報と言語処理 実践入門」の資料とソースコード☆17Updated this week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 解析が難しい日本の住所のテストデータセット☆14Sep 25, 2023Updated 2 years ago
- NDL古典籍OCRのアプリケーション(ソースコードを含む)☆97Oct 14, 2025Updated 8 months ago
- SATySFi commands and DSL for displaying derivation trees with maintainable code☆11Jan 2, 2021Updated 5 years ago
- ☆19Mar 12, 2026Updated 3 months ago
- RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalities☆64Mar 13, 2024Updated 2 years ago
- Japanese BERT Pretrained Model☆23Nov 13, 2021Updated 4 years ago
- 鴨川って快活CLUBだ☆16Jan 24, 2023Updated 3 years ago
- text-only archives of www.aozora.gr.jp☆93Mar 22, 2023Updated 3 years ago
- Mecab + NEologd + Docker + Python3☆36May 10, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆45Jun 2, 2026Updated 2 weeks ago
- Wikipediaを用いた日本語の固有表現抽出データセット☆143Sep 2, 2023Updated 2 years ago
- ☆22Sep 18, 2023Updated 2 years ago
- 進捗大陸で使用されたSATySFiファイル☆12May 22, 2023Updated 3 years ago
- 🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer☆289Feb 7, 2026Updated 4 months ago
- General-purpose Swich transformer based Japanese language model☆118Sep 13, 2023Updated 2 years ago
- Unofficial browser extension for Scrapbox☆30Jul 31, 2022Updated 3 years ago
- Japanese-BPEEncoder☆41Sep 12, 2021Updated 4 years ago
- ☆16Nov 19, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17Nov 30, 2023Updated 2 years ago
- RDF data for Knowledge Graph Reasoning Challenge.☆21Feb 28, 2025Updated last year
- Easily turn large English text datasets into Japanese text datasets using open LLMs.☆29Jan 20, 2025Updated last year
- ☆30Updated this week
- ☆12Dec 12, 2019Updated 6 years ago
- Pre-train Embedding in LightFM Recommender System Framework☆11Apr 28, 2019Updated 7 years ago
- Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese☆38Dec 29, 2025Updated 5 months ago