mfarisadip / T5-rlhf-pytorchView external linksLinks
Implementation of RLHF (Reinforcement Learning with Human Feedback) and GAN (Generative Adversarial Network) on top of the T5 architecture.
☆16Jan 2, 2023Updated 3 years ago
Alternatives and similar repositories for T5-rlhf-pytorch
Users that are interested in T5-rlhf-pytorch are comparing it to the libraries listed below
Sorting:
- Tensorflow: lstm, seq2seq model☆17Jun 27, 2016Updated 9 years ago
- Lite Self-Training☆30Jul 25, 2023Updated 2 years ago
- A repository for a Deep Q-Learning approach to intrusion detection for networks cyber-attacks.☆10Sep 3, 2021Updated 4 years ago
- Implementation of LSTM GAN for twitter posts generating.☆30Nov 4, 2016Updated 9 years ago
- Advanced Analytics data collection for M365 usage☆19Jan 29, 2026Updated 2 weeks ago
- a Federated Learning Framework adapted for resource-constrained environments, focusing on IoT devices☆10Oct 6, 2025Updated 4 months ago
- ☆10Dec 10, 2021Updated 4 years ago
- Implementation of Dynamic Computation Offloading Control Logic in a Software-Defined Vehicle (SDV) System☆11Dec 19, 2024Updated last year
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Deep Reinforcement Learning based Autonomous Driving Agents☆10Jul 7, 2022Updated 3 years ago
- This is a DQN-based recommendation system for item-list recommendation and it finally achieved second place in the competition of RL-base…☆11Oct 8, 2021Updated 4 years ago
- My undergraduate final project - Modeling and control of a distillation column using neural networks and reinforcement learning.☆12Apr 28, 2020Updated 5 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 2 years ago
- [ICML 2024 Oral] Consistent Adversarial Robust Deep Q Networks (CAR-DQN)☆15Feb 27, 2025Updated 11 months ago
- This operator will manage and configure data processing unit (DPUs) to be used in accelerating/offloading k8s networking functions☆12Jan 10, 2026Updated last month
- extractor chinese synonyms in large corpus☆11Jul 20, 2016Updated 9 years ago
- Reinforcement learning project using deep Q-learning to control the operations of an electrical microgrid☆10Jan 3, 2023Updated 3 years ago
- Thesis in Federated Learning using an Edge/Cloud Computing architecture☆10Feb 26, 2021Updated 4 years ago
- An HTTP client for the Rust AWS SDK that runs on Fastly Compute @ Edge☆10Nov 11, 2025Updated 3 months ago
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 3 years ago
- ☆10Jul 26, 2024Updated last year
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 2 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- CBLUE 2/3 任务实现☆10Aug 1, 2024Updated last year
- Source code for "Congestion-aware Distributed Task Offloading in Wireless Multi-hop Networks Using Graph Neural Networks"☆14Oct 23, 2024Updated last year
- ☆11Jan 11, 2022Updated 4 years ago
- Teaching the Donkey car to drive a track in the simulator using State Representation Learning and different Reinforcement Learning Algori…☆12Dec 6, 2021Updated 4 years ago
- Protect workers with TensorFlow Hard Hat object detection model on a Jetson Nano☆10Sep 27, 2022Updated 3 years ago
- ☆10May 1, 2025Updated 9 months ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 6 months ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆40Aug 14, 2023Updated 2 years ago
- reinforcement learning☆37Mar 20, 2018Updated 7 years ago
- This is the official implementation of "Optimizing Large-Scale Fleet Management on a Road Network using Multi-Agent Deep Reinforcement Le…☆37Apr 27, 2022Updated 3 years ago
- A simple DIY button that help you correct your console command and release stress XD☆11May 10, 2022Updated 3 years ago
- Closed-loop simulator of complex behavior and learning based on reinforcement learning and deep neural networks☆10Oct 21, 2025Updated 3 months ago
- Optimization of vehicle routing problem by deep reinforcement learning method based on residual edge-graph attention network☆16Dec 9, 2024Updated last year
- A repository to get acquainted with basic training tasks in natural language processing and machine learning☆11Dec 27, 2023Updated 2 years ago
- A bipedal humanoid control system using a Physics-Informed Neural Network (PINN) and Reinforcement Learning (RL) for stability and manipu…☆10Aug 15, 2024Updated last year
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago