flint-xf-fan/Federated-RLHF

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/flint-xf-fan/Federated-RLHF)

flint-xf-fan / Federated-RLHF

[AAMAS 2025] Privacy-preserving and Personalized RLHF, with convergence guarantees. The Code contains experiments for training multiple instances of GPT-2 for personalized sentiment aligned text generation.

☆16

Alternatives and similar repositories for Federated-RLHF

Users that are interested in Federated-RLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xuxiyang1993 / Multi_MCTS_Guidance_Separation_Assurance
View on GitHub
☆11Oct 2, 2020Updated 5 years ago
INSPIRE-Lab-US / Byzantine-resilient-distributed-learning
View on GitHub
Associated codebase for Byzantine-resilient distributed / decentralized machine learning papers from INSPIRE Lab
☆14Oct 11, 2021Updated 4 years ago
L3030 / FedCyBGD
View on GitHub
The implement of FedCyBGD
☆12Jul 19, 2024Updated 2 years ago
rui-ye / FedLLM-Bench
View on GitHub
☆121Aug 14, 2024Updated last year
JackKuo666 / a_numpy_based_implement_cnn
View on GitHub
这是我的博客《不用框架，使用Python搭建基于numpy的卷积神经网络来进行cifar-10分类的深度学习系统》的代码实现。
☆10Jul 1, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
WenkeHuang / SDEA
View on GitHub
ICML 2024 - Self-Driven Entropy Aggregation for Byzantine-Robust Heterogeneous Federated Learning
☆10Jul 16, 2024Updated 2 years ago
xiangqianL / ESPerHFL
View on GitHub
Personalized Client-Edge-Cloud Hierarchical Federated Learning on Non-IID Data
☆11Sep 7, 2023Updated 2 years ago
flint-xf-fan / MLDA-Workshop
View on GitHub
ML/DL training workshops for EEE undergrads
☆13Jan 16, 2019Updated 7 years ago
kurkale6ka / vim-swap
View on GitHub
Easy swapping of text in Vim
☆20Apr 6, 2020Updated 6 years ago
AsaadNA / Bank-Managment-System
View on GitHub
A simple managment system made with 8086
☆10Jun 23, 2021Updated 5 years ago
b2a3e8 / jekyll-theme-console-demo-hacker
View on GitHub
A demo site for the jekyll-theme-console theme.
☆12Jun 23, 2026Updated last month
tianbingsz / SVRG
View on GitHub
Stochastic Variance Reduction Policy Gradient Estimation
☆11Nov 6, 2018Updated 7 years ago
YingqiLiu1999 / DFedPGP
View on GitHub
☆14Jan 3, 2025Updated last year
git-disl / Fed-CDP
View on GitHub
Gradient-Leakage Resilient Federated Learning
☆15Jul 25, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
llm-eff / FedPepTAO
View on GitHub
☆33Mar 4, 2024Updated 2 years ago
lamccloskey / jekyll-theme-blogfolio
View on GitHub
A simple and easy to use blog and portfolio theme for Jekyll
☆10Apr 13, 2024Updated 2 years ago
yyy24601 / awesome-continual-learning
View on GitHub
Continual Learning: A Systematic Literature Review
☆17Nov 13, 2025Updated 8 months ago
PeixiLiu / humanMotionRadar
View on GitHub
Generate Micro-Doppler signature of human motion by radar
☆11Jul 2, 2023Updated 3 years ago
BUPT-ANTlab / PEPCRL-MVP
View on GitHub
☆17Oct 25, 2023Updated 2 years ago
JackqqWang / pfedHR
View on GitHub
☆12Oct 26, 2023Updated 2 years ago
epfml / topology-in-decentralized-learning
View on GitHub
Code related to ’Beyond spectral gap: The role of the topology in decentralized learning‘.
☆14Jun 7, 2022Updated 4 years ago
nbbrd / jdemetra-nowcasting
View on GitHub
☆17Jun 3, 2024Updated 2 years ago
realDuang / codemux
View on GitHub
The first open-source GUI for GitHub Copilot CLI — a multi-engine AI coding client with zero-config secure remote access from any device.
☆29Jul 1, 2026Updated 3 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
flint-xf-fan / Byzantine-Federated-RL
View on GitHub
[NeurIPS2021] Federated Reinforcement Learning with Theoretical Guarantees. The repo contains code and experiments for our Federated Poli…
☆106Apr 16, 2025Updated last year
Jody7 / netfilter-firewall
View on GitHub
Netfilter firewall to capture and intercept packets on port 80. Performs simple analysis and packet editing.
☆12Jan 27, 2016Updated 10 years ago
Arise-zwy / FedVLMBench
View on GitHub
☆24Jun 3, 2026Updated last month
WUR-ABE / rl_drone_object_search
View on GitHub
UAV-based path planning for efficient localization of non-uniformly distributed weeds using prior knowledge: A reinforcement-learning app…
☆15Jul 1, 2025Updated last year
wujingda / Multi-Hug-RL
View on GitHub
(TPAMI) Human-guided Reinforcement Learning with Sim-to-real Transfer for Autonomous Navigation
☆26Sep 18, 2023Updated 2 years ago
HUANGLIZI / MMFundus
View on GitHub
This repository is the official data collection of MMFundus (Multimodal Fundus) dataset.
☆13Feb 2, 2026Updated 5 months ago
facebookresearch / jailbreak-objectives
View on GitHub
Code and data to go with the Zhu et al. paper "An Objective for Nuanced LLM Jailbreaks"
☆37Jul 2, 2026Updated 3 weeks ago
brenda-Zheng / Exponential-Predefined-Time-Trajectory-Tracking-Control
View on GitHub
☆20Nov 21, 2023Updated 2 years ago
ducmngx / DDPG-UAV-Efficiency
View on GitHub
Using DDPG agent to control UAV system with energy efficiency
☆16Jan 7, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
castacks / cvar-energy-risk-deep-model
View on GitHub
CVaR-based Flight Energy Risk Assessment for Multirotor UAVs using a Deep Energy Model
☆26Jul 27, 2023Updated 3 years ago
GKthom / DeepQnetworks
View on GitHub
MATLAB implementation of DQN for a navigation environment
☆13Aug 13, 2020Updated 5 years ago
junkangwu / Dr_DPO
View on GitHub
[ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"
☆19Jun 1, 2024Updated 2 years ago
Clin0212 / Awesome-Federated-LLM-Learning
View on GitHub
Latest Advances on Federated LLM Learning
☆111Jul 7, 2025Updated last year
EchoSafe-MLLM / EchoSafe
View on GitHub
[CVPR 2026] Code for Evolving Contextual Safety in Multi-Modal Large Language Models via Inference-Time Self-Reflective Memory
☆15Mar 18, 2026Updated 4 months ago
d-f / llm-summarization
View on GitHub
LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset
☆14Feb 2, 2025Updated last year
LPD-EPFL / ByzantineMomentum
View on GitHub
Distributed Momentum for Byzantine-resilient Stochastic Gradient Descent (ICLR 2021)
☆22May 6, 2021Updated 5 years ago