tsmatz / huggingface-finetune-japanese
Examples to finetune encoder-only and encoder-decoder transformers for Japanese language in Hugging Face (Oct 2022)
☆15Updated last year
Alternatives and similar repositories for huggingface-finetune-japanese:
Users that are interested in huggingface-finetune-japanese are comparing it to the libraries listed below
- Japanese LLaMa experiment☆53Updated 3 months ago
- ☆42Updated last year
- Pre-training Language Models for Japanese☆49Updated last year
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆37Updated 2 years ago
- MAFAND-MT☆55Updated 8 months ago
- A series of notebooks demonstrating how to build simple NLP web apps with Gradio and Hugging Face transformers☆45Updated 3 years ago
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆41Updated last year
- An environment where you can try out faster-whisper immediately.☆37Updated 4 months ago
- Paraphrasing for academic texts☆14Updated 2 years ago
- Open source RAG with Llama Index for Japanese LLM in low resource settting☆8Updated last year
- This project aims at creating a search engine based on BERT language model.☆19Updated 4 years ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆87Updated last year
- Japanese / English Bilingual LLM☆11Updated 3 months ago
- Utility scripts for preprocessing Wikipedia texts for NLP☆76Updated 11 months ago
- LLM構築用の日本語チャットデータセット☆81Updated last year
- Awesome Question Answering☆28Updated 2 years ago
- Abstractive and Extractive Text summarization using Transformers.☆82Updated last year
- An example of multilingual machine translation using a pretrained version of mt5 from Hugging Face.☆42Updated 3 years ago
- Build Semantic Search with S-BERT and Fine-tune your model in unsupervised way☆58Updated 2 years ago
- Annotation meets Large Language Models (ChatGPT, GPT-3 and alike).☆56Updated last year
- Using short models to classify long texts☆21Updated 2 years ago
- Theoretical introduction for language processing terminologies (such as, embedding, encoder/decoder, attention, transformer, ...) and com…☆27Updated last month
- A simple implementation of SimCSE☆76Updated 2 years ago
- YAST - Yet Another SPLADE or Sparse Trainer☆16Updated last month
- COMET-ATOMIC ja☆29Updated last year
- FRAKE: Fusional Real-time Automatic Keyword Extraction☆21Updated last year
- Streamlit Named Entity Recognition (NER) annotation custom component☆38Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆71Updated last year
- ☆21Updated 4 years ago