mfarisadip / T5-rlhf-pytorchLinks
Implementation of RLHF (Reinforcement Learning with Human Feedback) and GAN (Generative Adversarial Network) on top of the T5 architecture.
☆15Updated 2 years ago
Alternatives and similar repositories for T5-rlhf-pytorch
Users that are interested in T5-rlhf-pytorch are comparing it to the libraries listed below
Sorting:
- Hybrid List Aware Transformer Reranking☆18Updated 2 years ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Updated 2 years ago
- ☆34Updated 11 months ago
- huggingface ChineseBert Tokenizer☆15Updated 3 years ago
- Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit☆28Updated 2 years ago
- A span-based joint named entity recognition (NER) and relation extraction model.☆10Updated 4 years ago
- ☆35Updated last year
- Lite Self-Training☆29Updated last year
- Code for Document-level Entity-based Extraction as Template Generation (EMNLP 2021)☆29Updated 3 years ago
- Poly-encoder architecture and pre-training pipeline implementation (pytorch)☆15Updated 4 years ago
- The source code of paper "PAIR-LEVEL SUPERVISED CONTRASTIVE LEARNING FOR NATURAL LANGUAGE INFERENCE" at ICASSP 2022.☆48Updated 2 years ago
- ROUGE for multilingual Summarization☆24Updated 3 years ago
- reformer-pytorch中文版本,简单高效的生成模型。类似GPT2的效果☆16Updated last year
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆20Updated 2 years ago
- Dual Cross Encoder for Dense Retrieval☆16Updated 2 years ago
- ☆25Updated 4 years ago
- Code for Unsupervised multi-granular Chinese word segmentation and term discovery via graph partition [JBI]☆16Updated 3 years ago
- This is the repository of Heterogeneous Transformer with Sparse Attention forLong-Text Extractive Summarization☆16Updated 3 years ago
- 2020 阿里云天池大数据竞赛-中医药文献问题生成挑战赛☆27Updated 3 years ago
- pytorch版simcse无监督语义相似模型☆22Updated 4 years ago
- DescriptionPairsExtraction, entity and it's description pairs extract program based on Albert and data back-annotation. 基于Albert与结构化数据回标思…☆20Updated 3 years ago
- GIFT (ACL 2023) & MPC-BERT (ACL 2021) for Multi-Party Conversation Understanding☆41Updated last year
- A simple example for finetuning HuggingFace T5 model. Includes code for intermediate generation.☆27Updated 4 years ago
- Entity Linking within a Social Media Platform☆11Updated 6 years ago
- ☆32Updated 4 years ago
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Updated 2 years ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆59Updated 5 months ago
- ☆45Updated 3 years ago
- Codebase for DualEnc (ACL-20)☆22Updated last year
- RAN: Recurrent Attention Networks for Long-text Modeling | Findings of ACL23☆22Updated last year