manueldeprada/Pretraining-T5-PyTorch-Lightning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/manueldeprada/Pretraining-T5-PyTorch-Lightning)

manueldeprada / Pretraining-T5-PyTorch-Lightning

Collection of scripts to pretrain T5 in unsupervised text, using PyTorch Lightning. CORD-19 pretraining provided as example.

☆32

Alternatives and similar repositories for Pretraining-T5-PyTorch-Lightning

Users that are interested in Pretraining-T5-PyTorch-Lightning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

joeljang / Pretraining_T5_custom_dataset
View on GitHub
Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints
☆38Mar 21, 2021Updated 5 years ago
hccngu / DialCoT
View on GitHub
DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
☆13Nov 2, 2023Updated 2 years ago
ceshine / finetuning-t5
View on GitHub
☆23Feb 6, 2022Updated 4 years ago
qhduan / mt5-soft-prompt-tuning
View on GitHub
☆45Sep 12, 2021Updated 4 years ago
MingjieWang0606 / 2021-Sohu-Text-Matching-TOP2
View on GitHub
☆13Jun 19, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JD-AI-Research-Silicon-Valley / auxiliary-task-for-text-to-sql
View on GitHub
☆13Oct 21, 2021Updated 4 years ago
alexrs / herd
View on GitHub
Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.
☆11Feb 11, 2024Updated 2 years ago
tatianapassali / artificial-disfluency-generation
View on GitHub
Generating artificial disfluencies from fluent text easily and promptly
☆16Sep 28, 2022Updated 3 years ago
iliaschalkidis / flash-roberta
View on GitHub
Hugging Face RoBERTa with Flash Attention 2
☆24Sep 14, 2025Updated 10 months ago
StefanHeng / ProgGen
View on GitHub
Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
☆17Mar 29, 2024Updated 2 years ago
SamLynnEvans / LSTM_with_attention
View on GitHub
Seq2seq using LSTM with attention from Luong et al
☆10Oct 2, 2018Updated 7 years ago
theziqi / DCCC
View on GitHub
The official repository for Dynamic Clustering and Cluster Contrastive Learning (DCCC).
☆14Dec 15, 2023Updated 2 years ago
liang8qi / Chinese-Text-Classification-Based-on-Bert
View on GitHub
基于Bert实现中文文本二分类
☆30Mar 2, 2020Updated 6 years ago
qasymjomart / ViT_recipe_for_AD
View on GitHub
Code implementation of the empirical study paper using ViT for AD classification
☆15Oct 30, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
lliai / AlphaTree-graphic-deep-neural-network
View on GitHub
机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN），图神经网络（GNN），NLP，大数据相关的发展路书(roadmap), 并附海量源码（python，pytorch）带大家消化基本知识点，突破面试，完成从新手到合格…
☆10Feb 25, 2020Updated 6 years ago
MikaStars39 / StableMask
View on GitHub
PyTorch implementation of StableMask (ICML'24)
☆15Jun 27, 2024Updated 2 years ago
jtonglet / Numerical-Hybrid-QA-Literature
View on GitHub
A list of Numerical Multimodal reasoning papers and their implementation
☆11May 13, 2024Updated 2 years ago
iBMLab / iMap4
View on GitHub
iMap4 - Spatial mapping of eye movement data (e.g., fixation map) using Linear Mixed Models
☆14May 29, 2018Updated 8 years ago
INK-USC / hierarchical-explanation-neural-sequence-models
View on GitHub
Source code for "Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models", ICLR 2020.
☆29Jun 28, 2020Updated 6 years ago
abhinavkashyap / domadapter
View on GitHub
Domain Adaptation and Adapters
☆16Feb 28, 2023Updated 3 years ago
microsoft / LiST
View on GitHub
Lite Self-Training
☆30Jul 25, 2023Updated 3 years ago
ybabakhin / kaggle-feedback-effectiveness-1st-place-solution
View on GitHub
Winning solution for the Kaggle Feedback Prize Challenge.
☆66Sep 5, 2022Updated 3 years ago
yakuza8 / first-order-predicate-logic-theorem-prover
View on GitHub
Autonomous Theorem Prover for First Order Predicate Logic
☆12Jun 29, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
UKPLab / emnlp2021-prompt-ft-heuristics
View on GitHub
☆10Sep 27, 2021Updated 4 years ago
alon-albalak / FLAD
View on GitHub
Few-shot Learning with Auxiliary Data
☆31Dec 8, 2023Updated 2 years ago
ZhangXu0963 / VSL
View on GitHub
The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.
☆15Dec 25, 2023Updated 2 years ago
myjlyjly / Extractive_Summarization
View on GitHub
☆10Mar 29, 2022Updated 4 years ago
Bai-YT / AdaptiveSmoothing
View on GitHub
Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".
☆10Feb 6, 2024Updated 2 years ago
ryansereno / vue-chat
View on GitHub
Starter template for LLM chat interface WITH text streaming
☆13Mar 5, 2024Updated 2 years ago
earth2observe / downscaling-tools
View on GitHub
python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis
☆10Nov 21, 2017Updated 8 years ago
swiseman / neighbor-splicing
View on GitHub
☆11Jan 2, 2022Updated 4 years ago
monologg / EncT5
View on GitHub
Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks
☆62Jan 22, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
nenoNaninu / CalorieCaptorGlass
View on GitHub
CalorieCaptorGlass : Food Calorie Estimation based on Actual Size using HoloLens and Deep Learning (IEEE VR 2020 Demo)
☆13Aug 11, 2021Updated 4 years ago
perceptiveshawty / RankCSE
View on GitHub
Implementation of "RankCSE: Unsupervised Sentence Representation Learning via Learning to Rank" (ACL 2023)
☆49Mar 12, 2024Updated 2 years ago
jiefisher / matcher
View on GitHub
rule matcher (context free grammar)
☆10Dec 27, 2019Updated 6 years ago
lxe / llama-tune
View on GitHub
LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers
☆50Mar 15, 2023Updated 3 years ago
JoaoLages / RATransformers
View on GitHub
RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!
☆42Dec 14, 2022Updated 3 years ago
MichaelEinhorn / trl-textworld
View on GitHub
☆13May 7, 2023Updated 3 years ago
liyongqi67 / LTRGR
View on GitHub
☆21Aug 9, 2024Updated last year