ppo算法实现
☆39Jun 5, 2024Updated last year
Alternatives and similar repositories for RLHF_PPO
Users that are interested in RLHF_PPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jun 30, 2023Updated 2 years ago
- ☆10Dec 10, 2023Updated 2 years ago
- TensorRT for Yolov3-tiny by convert model to onnx file☆12May 24, 2019Updated 6 years ago
- ☆10Sep 29, 2017Updated 8 years ago
- Physics-Guided Reinforcement Learning System for Realistic Vehicle Active Suspension Control (IEEE ICMLA 2023)☆26Aug 19, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A pytorch version of Yoon Kim's work(reproduced the Kim's result)☆13Feb 4, 2018Updated 8 years ago
- Meta-learning-based Cold-Start Sequential Recommendation☆16May 25, 2021Updated 4 years ago
- ☆16Jun 23, 2021Updated 4 years ago
- Image dataset for zero-shot character identification.☆25Mar 16, 2021Updated 5 years ago
- High-fidelity simulator for off-road driving☆32Jun 6, 2024Updated last year
- 基于ppo算法的计算卸载策略研究☆29Jan 17, 2023Updated 3 years ago
- Pytorch DDP Traning Demo☆30Oct 20, 2024Updated last year
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆30Dec 9, 2025Updated 3 months ago
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 基于Langchain-Chatchat以及BERT-VITS2的AI对话系统☆21Mar 20, 2024Updated 2 years ago
- Evaluation for AI apps and agent☆44Jan 18, 2024Updated 2 years ago
- MICCAI 2023 Challenges :STS-基于2D 全景图像的牙齿分割任务 初赛第一 复赛第四方案分享☆24Sep 22, 2023Updated 2 years ago
- 这是一个一键让小参数大模型进行角色扮演的项目,从数据构成和训练都包含在这项目中☆25Mar 31, 2024Updated last year
- ☆22Jun 1, 2012Updated 13 years ago
- [ICCV 2021 Oral] Mining Latent Classes for Few-shot Segmentation☆75Sep 25, 2021Updated 4 years ago
- 基于百度AI 的图片搜索、以图搜图、相似图查找☆40Apr 14, 2023Updated 2 years ago
- Domain-Adaptive Multibranch Networks☆14Nov 7, 2020Updated 5 years ago
- Group-Group Loss Based Global-Regional Feature Learning for Vehicle Re-Identification☆12May 10, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Cantonese segmentation tool 粵語分詞工具☆30Aug 22, 2020Updated 5 years ago
- ☆11Oct 9, 2019Updated 6 years ago
- The official implementation for Sequential Recommendation with Latent Relations based on Large Language Model☆44Nov 3, 2025Updated 4 months ago
- The implementation of the NeurIPS2020 paper: The Dilemma of TriHard Loss and an Element-Weighted TriHard Loss for Person Re-Identificatio…☆10Oct 22, 2020Updated 5 years ago
- Tensorflow implementation of SoftTriple Loss: Deep Metric Learning Without Triplet Sampling☆10Sep 26, 2019Updated 6 years ago
- Face recognition☆11Jun 20, 2019Updated 6 years ago
- A library of hashing methods for ANN (Approximate Nearest Neighbor) search.☆14Mar 24, 2017Updated 9 years ago
- Very accessible code for my MSc thesis. Inexpensive quantization method for ANN search also known as Enhanced Residual VQ.☆14Jun 15, 2020Updated 5 years ago
- Localizing the the digits in images from the Google SVHN (Street View House Numbers) dataset☆11Jul 28, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- (NeurIPS 2019) Combinatorial Inference against Label Noise☆11Jun 13, 2024Updated last year
- Temporal Lifting (TLift), a model-free temporal cooccurrence based score weighting method proposed in "Interpretable and Generalizable Pe…☆10Jul 24, 2020Updated 5 years ago
- [NeurIPS2019] Brain-Like Object Recognition with High-Performing Shallow Recurrent ANNs☆14Jan 26, 2020Updated 6 years ago
- ☆14Sep 7, 2022Updated 3 years ago
- Repository for hosting the code for the CVPR 2020 paper Differentiable Adaptive Computation Time for Visual Reasoning.☆14Aug 26, 2020Updated 5 years ago
- ☆12Jul 31, 2017Updated 8 years ago
- Code to convert the phototour patches dataset from Brown et al. learning descriptors paper to torch-lua☆10Oct 17, 2017Updated 8 years ago