☆43Feb 2, 2024Updated 2 years ago
Alternatives and similar repositories for llm-jp-corpus
Users that are interested in llm-jp-corpus are comparing it to the libraries listed below
Sorting:
- 2023 ABCI Llama-2 継続学習プロジェクト☆14Jan 22, 2024Updated 2 years ago
- ☆62Jun 13, 2024Updated last year
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆125Nov 13, 2025Updated 3 months ago
- ☆57Jun 17, 2024Updated last year
- 無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声 合成エンジン☆10Jan 30, 2023Updated 3 years ago
- Webブラウザから手軽にローカルLLMとおしゃべりできるソフトウェアです。☆27Dec 28, 2023Updated 2 years ago
- YAST - Yet Another SPLADE or Sparse Trainer☆21Jun 16, 2025Updated 8 months ago
- AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)☆18Jan 13, 2025Updated last year
- Repository for JSICK☆45May 31, 2023Updated 2 years ago
- ☆19May 23, 2024Updated last year
- Japanese LLaMa experiment☆54Dec 27, 2025Updated 2 months ago
- JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット☆43Sep 9, 2025Updated 6 months ago
- ☆16Mar 4, 2024Updated 2 years ago
- LLM構築用の日本語チャットデータセット☆88Jan 23, 2024Updated 2 years ago
- ☆27Nov 4, 2024Updated last year
- ☆11Jun 19, 2022Updated 3 years ago
- A framework for few-shot evaluation of autoregressive language models.☆153Sep 13, 2024Updated last year
- Exploring Japanese SimCSE☆69Oct 31, 2023Updated 2 years ago
- 【2024年版】BERTによるテキスト分類☆30Jul 8, 2024Updated last year
- Scripts for creating a Japanese-English parallel corpus and training NMT models☆18Nov 9, 2021Updated 4 years ago
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)☆87Updated this week
- This is a repository for comparing voice changer results and searching datasets and trained models.☆30May 21, 2023Updated 2 years ago
- Annotated Fuman Kaitori Center Corpus☆18Dec 18, 2023Updated 2 years ago
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆38Oct 7, 2025Updated 5 months ago
- ☆17May 31, 2023Updated 2 years ago
- JGLUE: Japanese General Language Understanding Evaluation☆337Mar 31, 2025Updated 11 months ago
- A library for semantic similarity search☆26Jan 31, 2025Updated last year
- Easy-to-use scripts to fine-tune GPT-2-JA with your own texts, to generate sentences, and to tweet them automatically.☆19Aug 26, 2025Updated 6 months ago
- Mixtral-based Ja-En (En-Ja) Translation model☆20Jan 6, 2025Updated last year
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆24Sep 17, 2025Updated 5 months ago
- RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalities☆63Mar 13, 2024Updated last year
- Kyoto University Web Document Leads Corpus☆83Dec 18, 2023Updated 2 years ago
- python版日本語意味役割付与システム(ASA)☆22Nov 11, 2022Updated 3 years ago
- Precise Anime face detection☆24May 26, 2024Updated last year
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- TSファイルからXMLTV形式の番組表を作成する☆11May 11, 2014Updated 11 years ago
- TIFMO: Textual Inference Forward-chaining MOdule☆12Apr 25, 2014Updated 11 years ago
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"☆12Jul 25, 2024Updated last year
- ☆10May 16, 2024Updated last year