ppo算法实现
☆41Jun 5, 2024Updated 2 years ago
Alternatives and similar repositories for RLHF_PPO
Users that are interested in RLHF_PPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Dec 10, 2023Updated 2 years ago
- Meta-learning-based Cold-Start Sequential Recommendation☆16May 25, 2021Updated 5 years ago
- ☆13Jul 26, 2021Updated 4 years ago
- ☆16Jun 23, 2021Updated 4 years ago
- 基于DPO算法微调语言大模型,简单好上手。☆52Jul 3, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 电子科技大学高级计算机视觉课程的作业代码☆13Sep 5, 2020Updated 5 years ago
- 🔥 🔥 [WACV2024] Mini but Mighty: Finetuning ViTs with Mini Adapters☆20Jul 5, 2024Updated last year
- [WIP]☆10Apr 8, 2017Updated 9 years ago
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆32Dec 9, 2025Updated 6 months ago
- [Findings of EMNLP'2024] Unified Active Retrieval for Retrieval Augmented Generation☆23Sep 30, 2024Updated last year
- 基于Langchain-Chatchat以及BERT-VITS2的AI对话系统☆21Mar 20, 2024Updated 2 years ago
- 10th place solution for the RSNA Pneumonia Detection Challenge☆15Nov 9, 2018Updated 7 years ago
- ☆23Jan 22, 2025Updated last year
- NLP moudle for Golang☆13Jul 19, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- MICCAI 2023 Challenges :STS-基于2D 全景图像的牙齿分割任务 初赛第一 复赛第四方案分享☆24Sep 22, 2023Updated 2 years ago
- A Vector Caching Scheme for Streaming FPGA SpMV Accelerators☆10Sep 7, 2015Updated 10 years ago
- Open-source strong baseline for domain generlization re-ID. We will udpate the strong baseline and CFD method~☆10Nov 30, 2021Updated 4 years ago
- 这是一个一键让小参数大模型进行角色扮演的项目,从数据构成和训练都包含在这项目中☆26Mar 31, 2024Updated 2 years ago
- A Python library for decoding and encoding AIS type 1 messages.☆11Jan 3, 2022Updated 4 years ago
- Domain-Adaptive Multibranch Networks☆14Nov 7, 2020Updated 5 years ago
- Group-Group Loss Based Global-Regional Feature Learning for Vehicle Re-Identification☆12May 10, 2022Updated 4 years ago
- ☆13May 28, 2021Updated 5 years ago
- Identify teeth in 3D using segmentation and labelling.☆28Jun 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Oct 9, 2019Updated 6 years ago
- ☆13Jun 24, 2024Updated last year
- A simple and effective feature alignment method with proposed anchor loss for person re-identification☆15Aug 18, 2020Updated 5 years ago
- A Claude Code skill for structured, spec-driven development with phase-by-phase workflow and living documentation☆31Feb 16, 2026Updated 3 months ago
- TOP5 code for 2017 AI Challenger (Competition of Scene Classification)☆15Mar 4, 2018Updated 8 years ago
- Deep Supervised Hashing with Triplet Labels☆10Nov 2, 2017Updated 8 years ago
- Reactor example, requires java 8 + reactor-core 3.x☆10Oct 22, 2017Updated 8 years ago
- 优雅的异步事件驱动网络框架; An elegant event-driven asynchronous network framework.☆11Dec 28, 2023Updated 2 years ago
- AXI-4 RAM Tester Component☆21Aug 5, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Face recognition☆11Jun 20, 2019Updated 6 years ago
- (NeurIPS 2019) Combinatorial Inference against Label Noise☆11Jun 13, 2024Updated last year
- Temporal Lifting (TLift), a model-free temporal cooccurrence based score weighting method proposed in "Interpretable and Generalizable Pe…☆10Jul 24, 2020Updated 5 years ago
- Train Faster R-CNN on Another dataset (Python implementation) -- see https://github.com/ShaoqingRen/faster_rcnn for the official MATLAB v…☆17Apr 24, 2017Updated 9 years ago
- ☆12Jul 31, 2017Updated 8 years ago
- A basic implementation of the Bag of (Visual) Words approach (BoW) for image search.☆14Jul 9, 2014Updated 11 years ago
- ☆12Jul 30, 2016Updated 9 years ago