ppo算法实现
☆40Jun 5, 2024Updated 2 years ago
Alternatives and similar repositories for RLHF_PPO
Users that are interested in RLHF_PPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 长文本相似度模型☆21Nov 24, 2023Updated 2 years ago
- dpo算法实现☆53Jun 12, 2024Updated 2 years ago
- ☆14Aug 26, 2024Updated last year
- ☆10Dec 10, 2023Updated 2 years ago
- fasttext with hierarchical softmax, implemented by tensorflow☆19Jul 15, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Jun 23, 2021Updated 5 years ago
- ☆16Mar 30, 2023Updated 3 years ago
- 🔥 🔥 [WACV2024] Mini but Mighty: Finetuning ViTs with Mini Adapters☆20Jul 5, 2024Updated last year
- [WIP]☆10Apr 8, 2017Updated 9 years ago
- [Findings of EMNLP'2024] Unified Active Retrieval for Retrieval Augmented Generation☆23Sep 30, 2024Updated last year
- The objective of this project is to utilize the IMDB data set to generate Meaningful and Interesting Insights and then create a movie rat…☆14May 21, 2018Updated 8 years ago
- lda 主题模型 用于主题提取和文本分类☆26Jul 8, 2017Updated 8 years ago
- WebGL skybox/raytracer for relativistic phenomena. Currently black hole and Alcubierre warp drive bubble.☆22Apr 18, 2026Updated 2 months ago
- prompt engineering ,llm ,text2sql☆39Oct 7, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- python CRF++实现分词☆37Jun 19, 2018Updated 8 years ago
- 基于MLP的互联网虚假新闻检测器☆15Oct 30, 2025Updated 8 months ago
- 基于LR的优化方法:梯度下降法,随机梯度下降法,牛顿法,LBFGS,BFGS☆36Aug 24, 2017Updated 8 years ago
- A Vector Caching Scheme for Streaming FPGA SpMV Accelerators☆10Sep 7, 2015Updated 10 years ago
- Open-source strong baseline for domain generlization re-ID. We will udpate the strong baseline and CFD method~☆10Nov 30, 2021Updated 4 years ago
- 这是一个一键让小参数大模型进行角色扮演的项目,从数据构成和训练都包含在这项目中☆27Mar 31, 2024Updated 2 years ago
- simple Conditional Random Field implementation in Python☆41Dec 15, 2017Updated 8 years ago
- large language model note☆26Jul 18, 2024Updated last year
- Domain-Adaptive Multibranch Networks☆14Nov 7, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Lazy imports in python☆10Jun 19, 2015Updated 11 years ago
- Group-Group Loss Based Global-Regional Feature Learning for Vehicle Re-Identification☆12May 10, 2022Updated 4 years ago
- ☆13May 28, 2021Updated 5 years ago
- Code for "SUGAR: Subgraph Neural Network with Reinforcement Pooling and Self-Supervised Mutual Information Mechanism""☆10Apr 17, 2021Updated 5 years ago
- teeth segmentation using UNet and customize attention module☆26Feb 26, 2024Updated 2 years ago
- Source code for "Training Generative Adversarial Networks Via Turing Test".☆14May 29, 2020Updated 6 years ago
- ☆11Oct 9, 2019Updated 6 years ago
- A simple and effective feature alignment method with proposed anchor loss for person re-identification☆15Aug 18, 2020Updated 5 years ago
- A2A Concept☆15Apr 10, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A feishu bot daily push arxiv latest articles.☆10Nov 28, 2021Updated 4 years ago
- TOP5 code for 2017 AI Challenger (Competition of Scene Classification)☆15Mar 4, 2018Updated 8 years ago
- The implementation of the NeurIPS2020 paper: The Dilemma of TriHard Loss and an Element-Weighted TriHard Loss for Person Re-Identificatio…☆10Oct 22, 2020Updated 5 years ago
- Deep Supervised Hashing with Triplet Labels☆10Nov 2, 2017Updated 8 years ago
- ☆81Apr 15, 2026Updated 2 months ago
- Face recognition☆11Jun 20, 2019Updated 7 years ago
- Very accessible code for my MSc thesis. Inexpensive quantization method for ANN search also known as Enhanced Residual VQ.☆14Jun 15, 2020Updated 6 years ago