This repository demonstrates how to fine-tune the Google Gemma 2 2B model to improve its performance on Japanese instruction-following tasks. It serves as a practical guide for developers and researchers interested in adapting large language models for specific languages or domains using state-of-the-art techniques in 2024.
☆12Aug 11, 2024Updated last year
Alternatives and similar repositories for gemma2_2b_finetune_jp_tutorial
Users that are interested in gemma2_2b_finetune_jp_tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Aug 3, 2023Updated 2 years ago
- Lion - EvoLved Sign Momentum w/ New Optimizer API in TensorFlow 2.11+☆10Feb 16, 2023Updated 3 years ago
- Wikipediaから作成した日本語名寄せデータセット☆35Mar 10, 2020Updated 6 years ago
- Streamlit apps on Cloud Run with Identity-Aware Proxy (IAP).☆10Mar 5, 2022Updated 4 years ago
- 2023 Capstone Design☆12Nov 2, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- AutoEncoder vs Metric Learning for Anomaly Detection☆27Jan 8, 2020Updated 6 years ago
- A TensorFlow implementation of the Continous Wavelet Transform based on the complex Morlet wavelet.☆14Aug 26, 2021Updated 4 years ago
- Teach a computer to play any game.☆10Updated this week
- Basic entity linker for the SNOMED EL Challenge☆14Jan 22, 2024Updated 2 years ago
- This released code is for our ACL2018 paper "End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions". …☆15May 28, 2018Updated 7 years ago
- OpenAi gym environment for the Rubik's Cube (3x3x3).☆14Sep 1, 2022Updated 3 years ago
- A tool for extracting plain text and internal Wikipedia links from Wikipedia dumps☆11Apr 18, 2019Updated 7 years ago
- The Seismo-Performer: A Novel Machine Learning Approach for General and Efficient Seismic Phase Recognition from Local Earthquakes in Rea…☆12Apr 25, 2022Updated 3 years ago
- Resources for grounding protein families and complexes from text and describing their hierarchical relationships.☆18Mar 26, 2026Updated 3 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆30Mar 9, 2021Updated 5 years ago
- ☆13Dec 21, 2021Updated 4 years ago
- ☆18Dec 10, 2022Updated 3 years ago
- For sequence-to-sequence beginners. PyTorch-implemented 1DCNN, LSTM, Attention, and Transformers.☆21Aug 3, 2025Updated 8 months ago
- ☆12Mar 20, 2020Updated 6 years ago
- An easy-to-use ML pipeline package for Python inspired by scikit-learn pipeline and PyTorch layers.☆12Aug 27, 2023Updated 2 years ago
- Code used to create the Linked WikiText-2 dataset☆16May 22, 2023Updated 2 years ago
- ☆10Jun 16, 2021Updated 4 years ago
- Wikipedia article dataset☆12May 10, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Demo of Machine Learning Prediction Model API with Django REST API Framework☆10Dec 22, 2019Updated 6 years ago
- Predict protein thermostability with ML☆20Aug 20, 2024Updated last year
- 🍣Transfer image style 🍣☆11Jun 2, 2017Updated 8 years ago
- Japanese Entity Linker.☆12Jul 25, 2021Updated 4 years ago
- ☆17Jun 24, 2021Updated 4 years ago
- End-to-End Learning from Complex Multigraphs with Latent-Graph Convolutional Networks☆15Jul 25, 2024Updated last year
- OpenMMLab Detection Toolbox and Benchmark☆19Feb 21, 2022Updated 4 years ago
- A comprehensive NLP preprocessing package for clinical notes sentence boundary detection, tokenization☆32May 22, 2024Updated last year
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Nov 26, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A PyTorch implementation of "Generating Sentences from a Continuous Space"☆13Feb 22, 2018Updated 8 years ago
- pytorch model for cross-lingual entity linking.☆16Mar 13, 2019Updated 7 years ago
- A simple ElasticSearch plugin wrapping around the search endpoint to provide Rocchio query expansion☆17Aug 5, 2017Updated 8 years ago
- ☆14Mar 19, 2026Updated last month
- ☆11May 12, 2019Updated 6 years ago
- 日本語テキストに対する wikification のためのソフトウェア☆17Mar 14, 2017Updated 9 years ago
- ☆11Dec 10, 2021Updated 4 years ago