owenliang/hf-ppo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/owenliang/hf-ppo)

owenliang / hf-ppo

Huggingface PPO Demo

☆30

Alternatives and similar repositories for hf-ppo

Users that are interested in hf-ppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

owenliang / qwen2.5-0.5b-grpo
View on GitHub
Qwen2.5 0.5B GRPO
☆86Feb 16, 2025Updated last year
titizheng / M3amba
View on GitHub
Implementation of "M3amba: Memory Mamba is All You Need for Whole Slide Image Classification". CVPR2025
☆11Feb 27, 2025Updated last year
JiehuiXie / PsychCoT-Tuning
View on GitHub
本项目对Deepseek-R1-Distill-Qwen-7B进行心理咨询CoT数据的LoRA微调，以进一步提升Deepseek-R1-Distill-Qwen-7B在心理咨询领域的慢思考能力。
☆12Mar 11, 2025Updated last year
ZhaozwTD / MMCAN
View on GitHub
Source code of "Multimodal Matching-aware Co-attention Networks with Mutual Knowledge Distillation for Fake News Detection"
☆14Nov 17, 2023Updated 2 years ago
shiivangii / Leveraging-Intra-and-Inter-Modality-Relationship-for-Multimodal-Fake-News-Detection
View on GitHub
☆10Apr 24, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
DIAGNijmegen / llm_extractinator
View on GitHub
This project enables the efficient extraction of structured data from unstructured text using large language models (LLMs). It provides a…
☆19Updated this week
peng-gao-lab / p4control
View on GitHub
P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF
☆11May 20, 2024Updated 2 years ago
Samyu0304 / thought-propagation
View on GitHub
Code and dataset for the ICLR 2024 paper "Thought Propagation: An analogical Approach to Complex Reasoning with Large Language Models."
☆16Mar 4, 2024Updated 2 years ago
HITSZ-NRSL / MSI-NeRF
View on GitHub
[WACV2025] Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field
☆14Nov 3, 2024Updated last year
chn-lee-yumi / MaterialSearch-core
View on GitHub
Core Python library for MaterialSearch project.
☆16Updated this week
shijian2001 / TemplateMatters
View on GitHub
A programmatic instruction template generator aiming at enhancing the understanding of the critical role instruction templates play in la…
☆15Dec 22, 2024Updated last year
SoonyangZhang / tcp-congestion-mininet
View on GitHub
test tcp congestion fairness on mininet
☆10Aug 18, 2020Updated 5 years ago
UESTC-GQJ / TieFake
View on GitHub
This is the source code of IJCNN 2023 paper TieFake: Title-Text Similarity and Emotion-Aware Fake News Detection (TieFake).
☆16Dec 21, 2023Updated 2 years ago
rort1989 / BH-HSMM
View on GitHub
Bayesian Hierarchical Hidden semi-Markov Model
☆11Aug 17, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
StevenHaojc / 3Dmedical_image_segment_pipline
View on GitHub
☆13Jan 23, 2024Updated 2 years ago
wadpac / hsmm4acc
View on GitHub
Behaviour detection in wearable movement sensor data
☆11Sep 4, 2019Updated 6 years ago
Littlefean / TimeManager
View on GitHub
一个时间管理类app项目，该app能够直观看到一周内自己把时间都花在了什么地方上。同时也可以很方便的记录时间。不仅可以管理时间，还可以记录经验，记录灵感，添加倒数日，添加周常事件。
☆15Jul 29, 2022Updated 3 years ago
juices6 / NLIN
View on GitHub
Natural Language-centered Inference Network for Multi-modal Fake News Detection
☆12Sep 23, 2024Updated last year
HUANGLIZI / TFCNs
View on GitHub
[ICANN 2022 Oral] This repository includes the official project of TFCNs, presented in our paper: TFCNs: A CNN-Transformer Hybrid Networ…
☆18Dec 20, 2022Updated 3 years ago
gaojingsheng / LAMM
View on GitHub
Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024
☆35Jan 3, 2024Updated 2 years ago
lzzppp / DERT
View on GitHub
MCAN
☆12Oct 11, 2025Updated 9 months ago
owenliang / nano-graphrag
View on GitHub
A simple, easy-to-hack GraphRAG implementation
☆15Sep 21, 2024Updated last year
huangmozhi9527 / GMMFormer
View on GitHub
[AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval
☆21May 10, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Xjj020315 / qwen3-master
View on GitHub
☆167Jun 25, 2025Updated last year
LuminosityX / FNE
View on GitHub
Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..
☆20Dec 3, 2023Updated 2 years ago
congyuxiaoyoudao / cs231n
View on GitHub
Implementations for assignments of Stanford CS231n: Deep Learning for Computer Vision, Spring 2025
☆16Sep 8, 2025Updated 10 months ago
JiuyangDong / HDMIL
View on GitHub
☆19Aug 3, 2025Updated 11 months ago
owenliang / another-pytorch
View on GitHub
A simple deep learning framework inspired by Dezero and PyTorch
☆31Jan 27, 2025Updated last year
DeepExperience / MuSEAgent
View on GitHub
A Multimodal Reasoning Agent with Stateful Experiences
☆24Mar 31, 2026Updated 3 months ago
zhanll / UE5_NiagaraFluid
View on GitHub
☆16Dec 22, 2021Updated 4 years ago
HuiGuanLab / DL-DKD
View on GitHub
Source code of the paper Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrieval
☆19May 13, 2026Updated 2 months ago
529082819 / vue-crm
View on GitHub
非常简单但又全面的的一个vue2后台管理系统、满足开发一个后台系统的基本需求。
☆17Jul 31, 2017Updated 8 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ZYangChen / HART
View on GitHub
The official implementation of "Hadamard Attention Recurrent Transformer: A Strong Baseline for Stereo Matching Transformer"
☆19Dec 9, 2025Updated 7 months ago
MengzSun / KDCN
View on GitHub
☆16Jul 15, 2022Updated 4 years ago
assissionZ / 3D-Virtual-Scene-Modeling
View on GitHub
基于opengl的3D虚拟场景建模，其中包含物体绘制、纹理贴图(天空盒)、Phong光照与阴影、视角切换(第一视角)等
☆12Dec 30, 2018Updated 7 years ago
FFarhangian / Fake-news-detection-Comparative-Study
View on GitHub
Official implementation of the paper : "All about automatic fake news detection: A wide comparative study"
☆14Sep 15, 2025Updated 10 months ago
chensi-cs / FastTrack4LLM
View on GitHub
FastTrack4LLM 是一个为大模型学习者准备的大模型学习与实践框架，帮助他们轻松掌握大模型的核心原理与训练流程，让每个人都能真正理解大模型的内部机制。本项目不仅完整复现了 LLaMA、Qwen、DeepSeek 等主流开源大模型架构，还覆盖了大模型的全生命周期：To…
☆31Nov 6, 2025Updated 8 months ago
gingasan / interactive-drama
View on GitHub
☆26Mar 4, 2025Updated last year
KunspireUp / workspace-easyjava
View on GitHub
mybatis generation tool | mybatis 生成工具
☆17Jul 3, 2026Updated 3 weeks ago