Joyce94/LLM-RLHF-Tuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Joyce94/LLM-RLHF-Tuning)

Joyce94 / LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

☆450

Alternatives and similar repositories for LLM-RLHF-Tuning

Users that are interested in LLM-RLHF-Tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aa12gq / goxenith
View on GitHub
Goxenith:『A12技术社区服务端』，基于Gin框架，采用cobra、viper、zap、ent、proto、redis、mysql、sqlite、email和jwt等多种技术栈，以实现高性能的Web应用开发、强大的命令行工具、灵活的配置管理、高效的日志记录、可靠的…
☆128Sep 14, 2023Updated 2 years ago
18148764734 / YunKePlayer
View on GitHub
一款炒鸡强大的mp4视频切片弹幕流式播放器！
☆240Feb 29, 2024Updated 2 years ago
zhenruyan / WSL-libre-linux-kernel
View on GitHub
Installing a 100% libre(free) linux kernel for wsl,It is possible to celebrate freedom within a cell. 给WSL替换自由内核!!!
☆220Jun 1, 2026Updated last month
xxsoftware / blog
View on GitHub
☆232May 1, 2025Updated last year
rawchen / feishu-bot
View on GitHub
飞书群聊/私聊ChatGPT机器人 - Spring Boot
☆159Nov 27, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
RobertWeijie / IS-KnowledgeBase
View on GitHub
来华留学生共享库 Foreign Students in China
☆119Jul 5, 2023Updated 2 years ago
ihandmine / anti-useragent
View on GitHub
fake pc or app browser useragent, anti useragent, and other awesome tools
☆215Sep 19, 2022Updated 3 years ago
mantoufan / yzhanProxy
View on GitHub
Web reverse proxy with automatic SSL, LFU caching, command-line configuration 支持自动配置 SSL 证书、LFU 缓存、用命令行配置的 Web 反向代理服务器
☆272May 30, 2023Updated 3 years ago
synbol / Kaggle-Contests
View on GitHub
Kaggle Solutions: A Record of My Kaggle & Other Contests Solving Journey.
☆86Sep 1, 2023Updated 2 years ago
flowhub-team / WholeGenomeSequencing-WGS
View on GitHub
Whole Genome Sequencing analysis, WGS analysis
☆245Jul 5, 2023Updated 2 years ago
redvelvets / qrpc
View on GitHub
一个关于自定义 RPC 框架的设计和实现
☆104Aug 15, 2023Updated 2 years ago
CL-lau / SQL-GPT
View on GitHub
Use ChatGPT to generate SQL and perform execution. Optimization and error correction of SQL is also possible.
☆345Mar 26, 2024Updated 2 years ago
ChongQingNoSubway / PDL
View on GitHub
Code for the paper " PDL: Regularizing Multiple Instance Learning with Progressive Dropout Layers "
☆103Jul 11, 2024Updated last year
yangyuke001 / SD-inference
View on GitHub
Stable Diffusion inference
☆171Apr 22, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
gqylpy / gqylpy-dict
View on GitHub
`gqylpy-dict` is based on the built-in `dict` and serves as an enhancement to it. It can do everything the built-in `dict` can do, and ev…
☆268May 27, 2024Updated 2 years ago
vnjohn / sms-provider-gateway
View on GitHub
支持多种短信服务提供商且可自行扩展
☆153Jun 5, 2023Updated 3 years ago
Touch-Sun / t-dispatch
View on GitHub
T - 调度⏱️ 一款开发迅速、学习简单、轻量级、易扩展分布式任务调度平台
☆137Jul 13, 2023Updated 2 years ago
yj8023xx / xrpc
View on GitHub
A lightweight, high throughput, and low latency RPC framework that supports the RDMA protocol
☆221Sep 15, 2024Updated last year
XGraph-Team / XFlow
View on GitHub
XFlow - A Python Library for Graph Flow
☆145Jun 28, 2025Updated last year
hiyin / scopeplus
View on GitHub
Scope+: An open source generalizable architecture for single-cell atlases at sample and cell levels
☆108Oct 21, 2024Updated last year
mantoufan / yzhanHTMLParser
View on GitHub
A streaming HTML parser based on HTML Standard. 基于 HTML 标准的流式 HTML 解析器
☆97Sep 12, 2023Updated 2 years ago
SpenserCai / sd-webui-go
View on GitHub
This is a Go language version of the SDK based on stable-diffusion-webui. In your code, you can directly use the API interfaces of stable…
☆337Mar 25, 2024Updated 2 years ago
yangyuke001 / DriveGPT
View on GitHub
auto drive from GPT
☆191Jul 24, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ihandmine / aioscpy
View on GitHub
An asyncio + aiolibs crawler imitate scrapy framework
☆115Apr 18, 2025Updated last year
luguosong / programming-notes
View on GitHub
Notes compiled while learning programming.
☆357Jun 22, 2026Updated last week
carsontung666 / Live-streaming-app
View on GitHub
An online live streaming client developed, using DUILib and Agora SDK.
☆140Jul 6, 2024Updated last year
CL-lau / Knowledge-Background-Vector-Warehouse
View on GitHub
☆63Jul 17, 2023Updated 2 years ago
Yuanyuan-Yuan / CacheQL
View on GitHub
Research Artifact of USENIX Security 2023 paper: CacheQL: Quantifying and Localizing Cache Side-Channel Vulnerabilities in Production Sof…
☆173Jun 29, 2023Updated 3 years ago
burncloud / wei
View on GitHub
Wei is a cross-platform automation management tool designed to simplify and automate the installation, deployment, and management of soft…
☆13Oct 13, 2025Updated 8 months ago
TengHu / pyloom
View on GitHub
A event sourcing framework for building large language model applications
☆66Aug 15, 2023Updated 2 years ago
linkxzhou / SimpleBase
View on GitHub
LessDB a serverless SQLite service designed to simplify the use of cloud-based MySQL, PostgreSQL, and other databases.
☆363Nov 22, 2025Updated 7 months ago
dqzboy / K8sHA-Deploy
View on GitHub
Deploying Highly Available Kubernetes Cluster using Binary Installation | 采用二进制方式部署高可用 Kubernetes 集群
☆360Jul 26, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
RavelloH / NeutralPress
View on GitHub
基于 Next.js 构建的下一代动态 CMS 博客系统，可免费部署的一站式解决方案：可视化可拖拽页面编辑、所见即所得/Markdown/MDX内容支持、媒体管理、访问分析、照片墙、自动友链管理、无限层级评论、邮箱通知、实时私信、Github 项目展示、多用户多权限账号管理…
☆413Jun 23, 2026Updated last week
Patrick9313 / Multi-person-chatroom
View on GitHub
基于Python网络编程的多人聊天室
☆119Sep 12, 2023Updated 2 years ago
kqhasaki / glorzo
View on GitHub
创造属于自己的音乐播放器～
☆56Sep 7, 2023Updated 2 years ago
qrpcode / pptshow
View on GitHub
Java generates PPT documents and supports the new features of PPTX version 2010 / Java生成PPT文档，支持2010版PPTX新特性
☆398Jun 15, 2023Updated 3 years ago
kfggww / cutest
View on GitHub
A simple unit test framework for c programming language.
☆34Aug 19, 2023Updated 2 years ago
Correr-Zhou / RepMode
View on GitHub
[CVPR 2023 (Highlight)] Offical implementation of the paper "RepMode: Learning to Re-parameterize Diverse Experts for Subcellular Structu…
☆163Oct 12, 2023Updated 2 years ago
Ho-Tung / Anti-Fraud
View on GitHub
🔥🔥🔥 Anti Fraud && Telemarketing Scams
☆47Apr 22, 2024Updated 2 years ago