Fine tune a T5 transformer model using PyTorch & Transformers🤗
☆219Feb 10, 2021Updated 5 years ago
Alternatives and similar repositories for T5-Finetuning-PyTorch
Users that are interested in T5-Finetuning-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pCLUE: 1000000+多任务提示学习数据集☆508Oct 4, 2022Updated 3 years ago
- ☆11Jul 11, 2023Updated 2 years ago
- Collection of scripts to pretrain T5 in unsupervised text, using PyTorch Lightning. CORD-19 pretraining provided as example.☆32Apr 26, 2021Updated 4 years ago
- ☆88Aug 25, 2023Updated 2 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- GOAT(山羊)是中英文大语言模型,基于LlaMa进行SFT。☆12Apr 24, 2023Updated 2 years ago
- clue chatyuan finetuning☆17Mar 10, 2025Updated last year
- ☆20Jan 18, 2022Updated 4 years ago
- ☆11Jul 28, 2021Updated 4 years ago
- Modified version of T5-DST for Dialogue State Tracking.☆19Dec 10, 2021Updated 4 years ago
- INSET: Sentence Infilling with Inter-sentential Transformer☆30Nov 21, 2020Updated 5 years ago
- Text perturbation methods to evaluate the robustness of NLP models☆20Oct 6, 2021Updated 4 years ago
- TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLO…☆980Sep 14, 2024Updated last year
- MetaQA: Combining Expert Agents for Multi-Skill Question Answering☆23Mar 13, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch - Albert Large V2, Bert Base Uncased, Bert Large Uncased WWM Finetuned Squad, Distil Roberta Base, Roberta Base Squad2, Roberta l…☆11Jul 10, 2020Updated 5 years ago
- Hybrid List Aware Transformer Reranking☆20Oct 25, 2022Updated 3 years ago
- A concise implementation of SimCSE☆16Aug 2, 2021Updated 4 years ago
- Summary of Text Classification in deep learning techniques implemented by PyTorch and TensorFlow. 深度学习文本分类技术总结,以PyTorch实现。☆14Dec 18, 2019Updated 6 years ago
- A PyTorch Implementation of the EMNLP 2020 paper "Mitigating Gender Bias for Neural Dialogue Generation with Adversarial Learning"☆13Feb 20, 2021Updated 5 years ago
- Plug-and-Play Document Modules for Pre-trained Models☆25May 28, 2023Updated 2 years ago
- ☆10May 26, 2022Updated 3 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆63Dec 6, 2022Updated 3 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Feb 1, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 💡GENIUS – generating text using sketches! A strong text generation & data augmentation tool.☆180Dec 9, 2022Updated 3 years ago
- Multi-task learning of Abstractive Summarization with Entailment Generation implemented using PyTorch☆16Jun 11, 2018Updated 7 years ago
- GeneSis is the first generative approach for lexical substitution (EMNLP 2021).☆13Jul 25, 2023Updated 2 years ago
- SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models☆26Jul 13, 2025Updated 9 months ago
- ☆13Sep 9, 2020Updated 5 years ago
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆12Jun 15, 2023Updated 2 years ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14May 19, 2020Updated 5 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,503Jan 14, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Data labeling using few shot learning GPT-3.☆25Mar 26, 2023Updated 3 years ago
- Accompanies Finastra's Hack to the Future 4 Learning Session "Sustainability reports & NLP"☆10Mar 17, 2022Updated 4 years ago
- A list of Numerical Multimodal reasoning papers and their implementation☆11May 13, 2024Updated last year
- ☆350Aug 8, 2021Updated 4 years ago
- In-context learning, Fine-Tuning, RLHF on Flan-T5☆13Aug 30, 2023Updated 2 years ago
- Covid Doctor chatbot using DialoGPT☆13May 25, 2022Updated 3 years ago
- 基于百度uie的关系抽取☆20Sep 26, 2022Updated 3 years ago