Implementation of RLHF (Reinforcement Learning with Human Feedback) and GAN (Generative Adversarial Network) on top of the T5 architecture.
☆17Jan 2, 2023Updated 3 years ago
Alternatives and similar repositories for T5-rlhf-pytorch
Users that are interested in T5-rlhf-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lite Self-Training☆30Jul 25, 2023Updated 2 years ago
- Python终端程序 获取实时股票数据☆13Mar 7, 2021Updated 5 years ago
- ☆10Oct 12, 2021Updated 4 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Mar 23, 2026Updated last week
- Build a Chinese conversational assistant robot with RASA(构建中文多轮任务型对话机器人)☆10Apr 1, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Jun 6, 2022Updated 3 years ago
- An optimized version of SeqGAN in pytorch☆12Apr 24, 2018Updated 7 years ago
- extractor chinese synonyms in large corpus☆11Jul 20, 2016Updated 9 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Hybrid List Aware Transformer Reranking☆19Oct 25, 2022Updated 3 years ago
- NinoLearn is a research framework for statistical ENSO prediction.☆10Jul 10, 2020Updated 5 years ago
- These are tools I cheated with the help of ChatGPT to help me with Penetration Testing and Red Teaming☆15Feb 24, 2024Updated 2 years ago
- Tensorflow: lstm, seq2seq model☆17Jun 27, 2016Updated 9 years ago
- Applying Deep Reinforcement Learning for dialogue generation. aka chatbot☆13Apr 30, 2017Updated 8 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Conditional Sequence Generative Adversarial Network trained with policy gradient, Implementation in Tensorflow☆49Dec 12, 2018Updated 7 years ago
- ☆17Oct 29, 2021Updated 4 years ago
- ☆18Aug 10, 2021Updated 4 years ago
- ☆15Aug 22, 2022Updated 3 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 3 years ago
- Input method control scripts collection for Vim☆25Oct 24, 2018Updated 7 years ago
- ☆11Jan 15, 2021Updated 5 years ago
- ☆12Jan 2, 2024Updated 2 years ago
- A TV videojs Player for Tizen/Webos. 一个电视机版本的videojs播放器☆10Dec 15, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- Catalogue of Life toolkit for Python☆11Aug 4, 2020Updated 5 years ago
- Final Project- Microscopy Cell Segmentation, Deep Learning and Applications Course, EE Dep., Ben-Gurion University☆13Jul 10, 2018Updated 7 years ago
- easy python web crawler sharing☆19Jun 9, 2017Updated 8 years ago
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12May 31, 2024Updated last year
- Source Code for KDD'19 paper "SurfCon: Synonym Discovery on Privacy-Aware Clinical Data"☆10Apr 10, 2020Updated 5 years ago
- Classic deep neural network models for text matching, and implementation with tensorflow.☆12Apr 21, 2019Updated 6 years ago
- Stroke-based Character Reconstruction ---> https://arxiv.org/abs/1806.08990☆15Dec 6, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…☆16Jun 3, 2024Updated last year
- A Wikipedia-based summarization dataset☆14Mar 27, 2023Updated 3 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 8 months ago
- Cleaned E2E NLG Challenge data + supporting scripts☆24Jan 19, 2021Updated 5 years ago
- ☆10May 1, 2025Updated 10 months ago
- Filter dialog data with a simple entropy-based method (see ACL paper)☆14Oct 4, 2019Updated 6 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago