llm-jp / awesome-japanese-llm
日本語LLMまとめ - Overview of Japanese LLMs
☆1,147Updated 2 weeks ago
Alternatives and similar repositories for awesome-japanese-llm:
Users that are interested in awesome-japanese-llm are comparing it to the libraries listed below
- A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese☆805Updated this week
- JGLUE: Japanese General Language Understanding Evaluation☆313Updated 3 weeks ago
- Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.☆734Updated last week
- Code for producing Japanese pretrained models provided by rinna Co., Ltd.☆582Updated 2 years ago
- BERT models for Japanese text.☆533Updated last year
- 「大規模言語モデル入門」(2023)と「大規模言語モデル入門Ⅱ〜生成型LLMの実装と評価」(2024)のGitHubリポジトリ☆397Updated 3 months ago
- NDLOCRアプリケーションのリポジトリ(ソースコードを含む)☆541Updated 2 months ago
- 【2023年版】BERTによるテキスト分類☆233Updated 11 months ago
- ☆817Updated this week
- ☆176Updated 6 months ago
- A Japanese NLP Library using spaCy as framework based on Universal Dependencies☆789Updated last year
- mecab-python. you can find original version here//taku910.github.io/mecab/☆555Updated 5 months ago
- Japanese GPT2 Generation Model☆317Updated last year
- A lexicon for Sudachi☆249Updated 2 months ago
- J-Moshi: A Japanese Full-duplex Spoken Dialogue System☆234Updated 2 months ago
- Python version of Sudachi, a Japanese tokenizer.☆405Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆151Updated 7 months ago
- A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.☆443Updated 3 months ago
- Japanese morphological analysis engine written in pure Python☆871Updated 2 months ago
- 日本語OCR☆234Updated 3 years ago
- This repository is archived! The maintained MeCab can be found https://github.com/shogo82148/mecab☆256Updated 6 months ago
- A Japanese Tokenizer for Business☆842Updated 3 months ago
- Japanese text normalizer for mecab-neologd☆279Updated last month
- ☆129Updated 2 weeks ago
- ☆295Updated 11 months ago
- Japanese word embedding with Sudachi and NWJC 🌿☆163Updated last year
- A Japanese tokenizer based on recurrent neural networks☆399Updated 10 months ago
- コードで学ぶAWS入門☆409Updated 2 years ago
- Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.☆930Updated 3 weeks ago
- ☆136Updated 11 months ago