manueldeprada / Pretraining-T5-PyTorch-LightningView external linksLinks
Collection of scripts to pretrain T5 in unsupervised text, using PyTorch Lightning. CORD-19 pretraining provided as example.
☆32Apr 26, 2021Updated 4 years ago
Alternatives and similar repositories for Pretraining-T5-PyTorch-Lightning
Users that are interested in Pretraining-T5-PyTorch-Lightning are comparing it to the libraries listed below
Sorting:
- ☆22Nov 25, 2021Updated 4 years ago
- The official repository for Dynamic Clustering and Cluster Contrastive Learning (DCCC).☆14Dec 15, 2023Updated 2 years ago
- ☆13Oct 21, 2021Updated 4 years ago
- ☆45Sep 12, 2021Updated 4 years ago
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 5 months ago
- ☆23Feb 6, 2022Updated 4 years ago
- Lite Self-Training☆30Jul 25, 2023Updated 2 years ago
- Source code for "Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models", ICLR 2020.☆30Jun 28, 2020Updated 5 years ago
- ☆26Aug 14, 2022Updated 3 years ago
- BLOOM 模型的指令微调☆24Jun 15, 2023Updated 2 years ago
- Winning solution for the Kaggle Feedback Prize Challenge.☆66Sep 5, 2022Updated 3 years ago
- 数据合成工具,简单高效的合成不同业务场景的大模型训练数据☆39Jan 2, 2025Updated last year
- [TMM-2022] Intra-class Adaptive Augmentation with Neighbor Correction for Deep Metric Learning, IEEE Transactions on Multimedia (T-MM), 2…☆29Jul 6, 2023Updated 2 years ago
- A Python Terminal script for displaying Corporate filings on BSE exchange.☆19Feb 28, 2024Updated last year
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆42Dec 14, 2022Updated 3 years ago
- A benchmark on predicting how small molecules change gene expression in different cell types.☆13Jul 4, 2025Updated 7 months ago
- CFBench: A Comprehensive Constraints-Following Benchmark for LLMs☆47Aug 26, 2024Updated last year
- ☆34Oct 30, 2020Updated 5 years ago
- 天池 新冠疫情相似句对判定大赛 top6方案☆77Jun 22, 2022Updated 3 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 6 months ago
- ☆14Jul 5, 2023Updated 2 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 2 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 2 years ago
- rule matcher (context free grammar)☆10Dec 27, 2019Updated 6 years ago
- python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis☆10Nov 21, 2017Updated 8 years ago
- Human ID classification using mmwave radar point cloud☆13Oct 18, 2025Updated 3 months ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- ☆10May 1, 2025Updated 9 months ago
- ☆34Mar 22, 2021Updated 4 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆94Jun 8, 2022Updated 3 years ago
- The code for paper "ProQA: Structural Prompt-based Pre-training for Unified Question Answering"☆11Feb 7, 2023Updated 3 years ago
- A novel incremental hierarchical clustering algorithm (KDD 22)☆10Aug 31, 2023Updated 2 years ago
- Template repository of a machine-learning Python project powered by FastAPI and PyTorch☆14Aug 26, 2021Updated 4 years ago
- ☆13Aug 11, 2024Updated last year
- ☆13May 7, 2023Updated 2 years ago
- code for our paper "Understanding by Understanding Not: Modeling Negation in Language Models"☆16Aug 15, 2022Updated 3 years ago
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 5 years ago
- Code of the paper "Efficient-End-to-end-Diffusion-Model-for-Onestep-SAR-to-Optical-Translation"☆23Jan 4, 2026Updated last month
- ☆11Oct 17, 2024Updated last year