Collection of scripts to pretrain T5 in unsupervised text, using PyTorch Lightning. CORD-19 pretraining provided as example.
☆32Apr 26, 2021Updated 4 years ago
Alternatives and similar repositories for Pretraining-T5-PyTorch-Lightning
Users that are interested in Pretraining-T5-PyTorch-Lightning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints☆38Mar 21, 2021Updated 5 years ago
- DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models☆13Nov 2, 2023Updated 2 years ago
- ☆13Jun 19, 2021Updated 4 years ago
- ☆23Feb 6, 2022Updated 4 years ago
- ☆45Sep 12, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Oct 21, 2021Updated 4 years ago
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 6 months ago
- Generating artificial disfluencies from fluent text easily and promptly☆15Sep 28, 2022Updated 3 years ago
- 基于Bert实现中文文本二分类☆29Mar 2, 2020Updated 6 years ago
- The official repository for Dynamic Clustering and Cluster Contrastive Learning (DCCC).☆14Dec 15, 2023Updated 2 years ago
- Seq2seq using LSTM with attention from Luong et al☆10Oct 2, 2018Updated 7 years ago
- 诗云APP,一款寓教于乐的古诗词普及APP。☆11Oct 3, 2022Updated 3 years ago
- 机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN),图神经网络(GNN),NLP,大数据相关的发展路书(roadmap), 并附海量源码(python,pytorch)带大家消化基本知识点,突破面试,完成从新手到合格…☆10Feb 25, 2020Updated 6 years ago
- ☆34Oct 30, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Lite Self-Training☆30Jul 25, 2023Updated 2 years ago
- Source code for "Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models", ICLR 2020.☆29Jun 28, 2020Updated 5 years ago
- Winning solution for the Kaggle Feedback Prize Challenge.☆66Sep 5, 2022Updated 3 years ago
- Domain Adaptation and Adapters☆16Feb 28, 2023Updated 3 years ago
- iMap4 - Spatial mapping of eye movement data (e.g., fixation map) using Linear Mixed Models☆12May 29, 2018Updated 7 years ago
- A Python Terminal script for displaying Corporate filings on BSE exchange.☆19Feb 28, 2024Updated 2 years ago
- Introduction page of a challenging text-to-SQL dataset: KaggleDBQA☆43Sep 20, 2023Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- ☆10Sep 27, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of Self-Governing Neural Networks for speech act classification☆12Nov 5, 2025Updated 4 months ago
- Starter template for LLM chat interface WITH text streaming☆12Mar 5, 2024Updated 2 years ago
- ☆10Mar 29, 2022Updated 3 years ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis☆10Nov 21, 2017Updated 8 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆62Jan 22, 2022Updated 4 years ago
- CalorieCaptorGlass : Food Calorie Estimation based on Actual Size using HoloLens and Deep Learning (IEEE VR 2020 Demo)☆13Aug 11, 2021Updated 4 years ago
- rule matcher (context free grammar)☆10Dec 27, 2019Updated 6 years ago
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆50Mar 15, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Implementation of "RankCSE: Unsupervised Sentence Representation Learning via Learning to Rank" (ACL 2023)☆48Mar 12, 2024Updated 2 years ago
- ☆26Aug 14, 2022Updated 3 years ago
- The code for paper "ProQA: Structural Prompt-based Pre-training for Unified Question Answering"☆11Feb 7, 2023Updated 3 years ago
- ☆13May 7, 2023Updated 2 years ago
- Google DeepMind: Mixture of Depths Unofficial Implementation.☆12May 29, 2024Updated last year
- Video-Language Alignment via Spatio–Temporal Graph Transformer; ArXiv: https://arxiv.org/abs/2407.11677☆14Jul 24, 2024Updated last year
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 5 years ago