text-only archives of www.aozora.gr.jp
☆93Mar 22, 2023Updated 3 years ago
Alternatives and similar repositories for aozorabunko_text
Users that are interested in aozorabunko_text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆861Updated this week
- Easy-to-use scripts to fine-tune GPT-2-JA with your own texts, to generate sentences, and to tweet them automatically.☆19Aug 26, 2025Updated 9 months ago
- Japanese tokenizer for Transformers☆79Dec 15, 2023Updated 2 years ago
- Tutorials for learning Ruby☆13May 11, 2020Updated 6 years ago
- ☆22Jan 11, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 本リポジトリは「AllenNLP入門」のソースコード置き場です。☆35May 1, 2023Updated 3 years ago
- Annotated Fuman Kaitori Center Corpus☆18Dec 18, 2023Updated 2 years ago
- ☆15Mar 31, 2020Updated 6 years ago
- Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.☆89Nov 3, 2023Updated 2 years ago
- 🐣 A mini-game to raise "Uzimaru" on GitHub contributions☆11Jul 8, 2022Updated 3 years ago
- Multimodal dataset for ad text generation in Japanese [Mita+, ACL2024]☆26Aug 13, 2024Updated last year
- Repository for JSICK☆46May 31, 2023Updated 3 years ago
- はらへりに羊羹を渡す矢野エリカ (Twitter bot implemented in Deno)☆12Jun 1, 2020Updated 6 years ago
- IPAdic packaged for easy use from Python.☆24Oct 31, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- About Me☆11Feb 24, 2025Updated last year
- PDEJS - Plugin(Preact-like) Declarative for Editor.js☆15Dec 4, 2025Updated 6 months ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated 2 years ago
- 日本十進分類法のIME辞書☆11Dec 8, 2022Updated 3 years ago
- AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)☆20Jan 13, 2025Updated last year
- pygeonlp, A python module for geotagging Japanese texts.☆22Mar 24, 2026Updated 2 months ago
- Japanese GPT2 Generation Model☆324Sep 2, 2023Updated 2 years ago
- Rails app for managing a conference CFP☆14Apr 5, 2026Updated 2 months ago
- A paper list for box embeddings☆17Jun 9, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Japanese Morphological Analyzer written in pure Rust☆26Oct 25, 2019Updated 6 years ago
- 日本語で書かれた技術書を収集した生コーパス/ツール☆26Apr 8, 2026Updated 2 months ago
- ☆19Apr 21, 2026Updated last month
- textlint rule plugin to check duplicated conjunctive particle `ga` in a sentence.☆11Nov 26, 2023Updated 2 years ago
- You can create datasets from Wikia/Wikipedia that can be used for entity recognition and Entity Linking. Dumps for ja-wiki and VTuber-wik…☆18May 2, 2021Updated 5 years ago
- デジタル化資料OCRテキスト化事業において作成されたOCR学習用データセット☆83Jun 26, 2024Updated last year
- nishika akutagawa compedition 2nd prize : https://www.nishika.com/competitions/1/summary☆25Mar 6, 2020Updated 6 years ago
- 【2024年版】BERTによるテキスト分類☆30Jul 8, 2024Updated last year
- 青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット☆22Jan 17, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆30Apr 10, 2025Updated last year
- aozorahack全般に関するissue/wiki用リポジトリです☆192Oct 30, 2015Updated 10 years ago
- いらすとや検索コマンド☆61Jan 29, 2016Updated 10 years ago
- 文中に同じ助詞が複数出てくるのをチェックするtextlintルール☆24May 8, 2026Updated last month
- Japanese-BPEEncoder☆41Sep 12, 2021Updated 4 years ago
- AllenNLP integration for Shiba: Japanese CANINE model☆12Jun 26, 2021Updated 4 years ago
- ☆17May 31, 2023Updated 3 years ago