ce-lery/japanese-mistral-300m-recipe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ce-lery/japanese-mistral-300m-recipe)

ce-lery / japanese-mistral-300m-recipe

☆19

Alternatives and similar repositories for japanese-mistral-300m-recipe

Users that are interested in japanese-mistral-300m-recipe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KanHatakeyama / JapaneseWarcParser
View on GitHub
☆16Mar 4, 2024Updated 2 years ago
lighttransport / japanese-llama-experiment
View on GitHub
Japanese LLaMa experiment
☆54Dec 27, 2025Updated 7 months ago
kunishou / do-not-answer-ja
View on GitHub
☆24Dec 15, 2023Updated 2 years ago
johnknash2025 / stanford_alpaca
View on GitHub
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆24Mar 19, 2023Updated 3 years ago
sonoisa / clip-japanese
View on GitHub
日本語CLIPモデル
☆13Sep 15, 2025Updated 10 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
iwiwi / epochraft-hf-fsdp
View on GitHub
Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP
☆11Jan 29, 2024Updated 2 years ago
nu-dialogue / real-persona-chat
View on GitHub
RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalities
☆66Mar 13, 2024Updated 2 years ago
junya-takayama / DIRECT
View on GitHub
DIRECT: Direct and Indirect REsponses in Conversational Text Corpus
☆17Jul 1, 2021Updated 5 years ago
Ino-Ichan / GIT-LLM
View on GitHub
☆21Sep 18, 2023Updated 2 years ago
kunishou / oasst1-89k-ja
View on GitHub
☆16Nov 19, 2023Updated 2 years ago
kotoba-tech / kotoba-recipes
View on GitHub
Support Continual pre-training & Instruction Tuning forked from llama-recipes
☆34Feb 17, 2024Updated 2 years ago
kenoharada / labudy
View on GitHub
☆19Nov 12, 2025Updated 8 months ago
softmatcha / softmatcha
View on GitHub
A soft and fast pattern matcher for billion-scale corpora.
☆75Feb 26, 2025Updated last year
MorenoLaQuatra / bart-it
View on GitHub
Pre-training BART model for the Italian Language
☆16Dec 28, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
leia-llm / leia
View on GitHub
LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation
☆23Apr 24, 2024Updated 2 years ago
UCLabNU / OpenUAS
View on GitHub
☆15Jun 17, 2025Updated last year
pfnet-research / pfgen-bench
View on GitHub
Preferred Generation Benchmark
☆103Mar 6, 2026Updated 4 months ago
llm-jp / llm-jp-tokenizer
View on GitHub
☆48Mar 30, 2026Updated 3 months ago
cnamejj / PyProc
View on GitHub
Linux /proc data in a consistent, parsed format.
☆10Mar 28, 2016Updated 10 years ago
frodo821 / BitNet-Transformers
View on GitHub
0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" i…
☆98Mar 1, 2024Updated 2 years ago
shimo-lab / Universal-Geometry-with-ICA
View on GitHub
Discovering Universal Geometry in Embeddings with ICA (Published in EMNLP 2023)
☆22Jun 17, 2025Updated last year
usm-takl / tiny-lang-with-lsp
View on GitHub
☆12Dec 22, 2020Updated 5 years ago
Hajime-Y / reasoning-model
View on GitHub
☆49Dec 18, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
hppRC / llm-translator
View on GitHub
Mixtral-based Ja-En (En-Ja) Translation model
☆20Jan 6, 2025Updated last year
turingmotors / vlm-recipes
View on GitHub
☆20Aug 28, 2024Updated last year
zubayerhimel / kanban-with-tailwindcss
View on GitHub
Kanban board made with TailwindCSS
☆11Jun 10, 2021Updated 5 years ago
U-C4N / Deepseek-CoT
View on GitHub
Deepseek-CoT
☆10Oct 6, 2024Updated last year
kunishou / databricks-dolly-15k-ja
View on GitHub
☆89Jul 25, 2023Updated 3 years ago
AkariGroup / akari_chatgpt_bot
View on GitHub
音声認識、文章生成、音声合成を使って対話するチャットボットアプリ
☆48Oct 16, 2025Updated 9 months ago
m13253 / midi-track-merge
View on GitHub
Merge multi-track MIDI sequence into a single track for further processing
☆12Nov 4, 2020Updated 5 years ago
okoge-kaz / llm-recipes
View on GitHub
Ongoing Research Project for continaual pre-training LLM(dense mode)
☆45Mar 3, 2025Updated last year
lighttransport / jagger-python
View on GitHub
Python binding for Jagger(C++ implementation of Pattern-based Japanese Morphological Analyzer)
☆13Dec 16, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
anlp-nenji / nlproceedings
View on GitHub
LaTeX document class for the proceedings of ANLP
☆21Oct 28, 2025Updated 9 months ago
mizuumi / JDocQA
View on GitHub
☆44Apr 10, 2025Updated last year
projectlucas / efficient_whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆19Dec 1, 2022Updated 3 years ago
ikegami-yukino / dataset-list
View on GitHub
lists of text corpus and more (mainly Japanese)
☆119Jul 25, 2024Updated 2 years ago
Faildes / Universal-Model-Merge-Scripter
View on GitHub
Creates CMM script that can directly executed on Kaggle from easy merge script
☆14Mar 6, 2026Updated 4 months ago
gepuro / ai_agent_company_research
View on GitHub
☆15Jan 26, 2025Updated last year
masanorihirano / llm-japanese-dataset
View on GitHub
LLM構築用の日本語チャットデータセット
☆88Jan 23, 2024Updated 2 years ago