Implementation of RLHF (Reinforcement Learning with Human Feedback) and GAN (Generative Adversarial Network) on top of the T5 architecture.
☆16Jan 2, 2023Updated 3 years ago
Alternatives and similar repositories for T5-rlhf-pytorch
Users that are interested in T5-rlhf-pytorch are comparing it to the libraries listed below
Sorting:
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆35Feb 26, 2026Updated last week
- Tensorflow: lstm, seq2seq model☆17Jun 27, 2016Updated 9 years ago
- Lite Self-Training☆30Jul 25, 2023Updated 2 years ago
- Implementation of LSTM GAN for twitter posts generating.☆30Nov 4, 2016Updated 9 years ago
- A repository for a Deep Q-Learning approach to intrusion detection for networks cyber-attacks.☆10Sep 3, 2021Updated 4 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- a Federated Learning Framework adapted for resource-constrained environments, focusing on IoT devices☆10Oct 6, 2025Updated 5 months ago
- Advanced Analytics data collection for M365 usage☆19Updated this week
- Deep Reinforcement Learning based Autonomous Driving Agents☆10Jul 7, 2022Updated 3 years ago
- ☆11Jan 11, 2022Updated 4 years ago
- My undergraduate final project - Modeling and control of a distillation column using neural networks and reinforcement learning.☆12Apr 28, 2020Updated 5 years ago
- [ICML 2024 Oral] Consistent Adversarial Robust Deep Q Networks (CAR-DQN)☆15Feb 27, 2025Updated last year
- extractor chinese synonyms in large corpus☆11Jul 20, 2016Updated 9 years ago
- Source code for "Congestion-aware Distributed Task Offloading in Wireless Multi-hop Networks Using Graph Neural Networks"☆14Oct 23, 2024Updated last year
- Thesis in Federated Learning using an Edge/Cloud Computing architecture☆10Feb 26, 2021Updated 5 years ago
- This operator will manage and configure data processing unit (DPUs) to be used in accelerating/offloading k8s networking functions☆12Feb 13, 2026Updated 3 weeks ago
- Reinforcement learning project using deep Q-learning to control the operations of an electrical microgrid☆10Jan 3, 2023Updated 3 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 2 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 3 years ago
- Protect workers with TensorFlow Hard Hat object detection model on a Jetson Nano☆10Sep 27, 2022Updated 3 years ago
- ☆10May 1, 2025Updated 10 months ago
- Teaching the Donkey car to drive a track in the simulator using State Representation Learning and different Reinforcement Learning Algori…☆12Dec 6, 2021Updated 4 years ago
- ☆10Jul 26, 2024Updated last year
- An HTTP client for the Rust AWS SDK that runs on Fastly Compute @ Edge☆10Nov 11, 2025Updated 3 months ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 7 months ago
- This is a DQN-based recommendation system for item-list recommendation and it finally achieved second place in the competition of RL-base…☆11Oct 8, 2021Updated 4 years ago
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 3 years ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆40Aug 14, 2023Updated 2 years ago
- reinforcement learning☆37Mar 20, 2018Updated 7 years ago
- This is the official implementation of "Optimizing Large-Scale Fleet Management on a Road Network using Multi-Agent Deep Reinforcement Le…☆38Apr 27, 2022Updated 3 years ago
- Multi-objective application placement in fog computing using graph neural network-based reinforcement learning☆10Oct 20, 2025Updated 4 months ago
- [IPSN 2024] Lifelong Intelligence Beyond the Edge using Hyperdimensional Computing☆13May 16, 2024Updated last year
- A simple DIY button that help you correct your console command and release stress XD☆11May 10, 2022Updated 3 years ago
- Various DQN method with cartpole☆11May 30, 2018Updated 7 years ago
- A CQRS implementation in nodeJS with promises.☆13Dec 1, 2017Updated 8 years ago
- A context-aware embedding similarity score☆11Aug 23, 2023Updated 2 years ago
- Backend.AI Client Library for Python☆10Sep 22, 2023Updated 2 years ago