tsmatz / huggingface-finetune-japaneseLinks
Examples to finetune encoder-only and encoder-decoder transformers for Japanese language in Hugging Face (Oct 2022)
☆16Updated last year
Alternatives and similar repositories for huggingface-finetune-japanese
Users that are interested in huggingface-finetune-japanese are comparing it to the libraries listed below
Sorting:
- ☆41Updated last year
- Annotation meets Large Language Models (ChatGPT, GPT-3 and alike).☆56Updated 2 years ago
- Support Continual pre-training & Instruction Tuning forked from llama-recipes☆32Updated last year
- Pre-training Language Models for Japanese☆49Updated last year
- MobileBERT and DistilBERT for extractive summarization☆89Updated last year
- An example of multilingual machine translation using a pretrained version of mt5 from Hugging Face.☆42Updated 4 years ago
- Streamlit Named Entity Recognition (NER) annotation custom component☆38Updated 2 years ago
- Comparing M2M and mT5 on a rare language pairs, blog post: https://medium.com/@abdessalemboukil/comparing-facebooks-m2m-to-mt5-in-low-re…☆15Updated 3 years ago
- Awesome Question Answering☆29Updated 2 years ago
- Web App Capable of Predicting Next Word Using BERT☆14Updated 2 years ago
- Japanese LLaMa experiment☆52Updated 6 months ago
- Abstractive and Extractive Text summarization using Transformers.☆83Updated last year
- COMET-ATOMIC ja☆29Updated last year
- Utility scripts for preprocessing Wikipedia texts for NLP☆77Updated last year
- ☆41Updated last year
- A series of notebooks demonstrating how to build simple NLP web apps with Gradio and Hugging Face transformers☆45Updated 3 years ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆50Updated last month
- You can create datasets from Wikia/Wikipedia that can be used for entity recognition and Entity Linking. Dumps for ja-wiki and VTuber-wik…☆17Updated 4 years ago
- Paraphrasing for academic texts☆14Updated 2 years ago
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆39Updated 2 weeks ago
- LLM構築用の日本語チャットデータセット☆83Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated 10 months ago
- A soft and fast pattern matcher for billion-scale corpora.☆56Updated 3 months ago
- doccano auto labeling pipeline helps doccano to annotate a document automatically.☆42Updated last year
- Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda☆13Updated last month
- YAST - Yet Another SPLADE or Sparse Trainer☆18Updated 2 weeks ago
- Implementation of ECIR 2022 Paper: How Can Graph Neural Networks Help Document Retrieval: A Case Study on CORD19 with Concept Map Generat…☆15Updated 2 years ago
- Simply, faster, sentence-transformers☆142Updated 9 months ago
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!☆34Updated 4 years ago