☆44Feb 2, 2024Updated 2 years ago
Alternatives and similar repositories for llm-jp-corpus
Users that are interested in llm-jp-corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 2023 ABCI Llama-2 継続学習プロジェクト☆14Jan 22, 2024Updated 2 years ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆124Nov 13, 2025Updated 4 months ago
- ☆57Jun 17, 2024Updated last year
- ☆62Jun 13, 2024Updated last year
- YAST - Yet Another SPLADE or Sparse Trainer☆21Jun 16, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Japanese LLaMa experiment☆54Dec 27, 2025Updated 3 months ago
- AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)☆20Jan 13, 2025Updated last year
- 無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン☆10Jan 30, 2023Updated 3 years ago
- Repository for JSICK☆45May 31, 2023Updated 2 years ago
- ☆19May 23, 2024Updated last year
- Webブラウザから手軽にローカルLLMとおしゃべりできるソフトウェアです。☆27Dec 28, 2023Updated 2 years ago
- ☆46Updated this week
- LLM構築用の日本語チャットデータセット☆88Jan 23, 2024Updated 2 years ago
- TIFMO: Textual Inference Forward-chaining MOdule☆12Apr 25, 2014Updated 11 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10May 16, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆154Sep 13, 2024Updated last year
- ☆16Mar 4, 2024Updated 2 years ago
- JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット☆43Sep 9, 2025Updated 6 months ago
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆38Oct 7, 2025Updated 5 months ago
- ☆11Jun 19, 2022Updated 3 years ago
- JGLUE: Japanese General Language Understanding Evaluation☆337Mar 31, 2025Updated 11 months ago
- ☆27Nov 4, 2024Updated last year
- Annotated Fuman Kaitori Center Corpus☆18Dec 18, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Scripts for creating a Japanese-English parallel corpus and training NMT models☆18Nov 9, 2021Updated 4 years ago
- Python scripts for AI voice changers☆14Apr 25, 2023Updated 2 years ago
- Exploring Japanese SimCSE☆69Oct 31, 2023Updated 2 years ago
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)☆88Mar 16, 2026Updated last week
- 「自然言語処理の教科書」サポートサイト☆14Apr 1, 2025Updated 11 months ago
- Easy-to-use scripts to fine-tune GPT-2-JA with your own texts, to generate sentences, and to tweet them automatically.☆19Aug 26, 2025Updated 7 months ago
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"☆12Jul 25, 2024Updated last year
- Support Continual pre-training & Instruction Tuning forked from llama-recipes☆34Feb 17, 2024Updated 2 years ago
- Kyoto University Web Document Leads Corpus☆83Dec 18, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- python版日本語意味役割付与システム(ASA)☆22Nov 11, 2022Updated 3 years ago
- Finetuning LLaMA with DeepSpeed☆10Apr 14, 2023Updated 2 years ago
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Jan 12, 2025Updated last year
- Mixtral-based Ja-En (En-Ja) Translation model☆20Jan 6, 2025Updated last year
- ☆89Jul 25, 2023Updated 2 years ago
- ☆50Apr 10, 2024Updated last year
- Utility scripts for preprocessing Wikipedia texts for NLP☆78Apr 9, 2024Updated last year