Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts
☆25Feb 23, 2024Updated 2 years ago
Alternatives and similar repositories for relative-preference-optimization
Users that are interested in relative-preference-optimization are comparing it to the libraries listed below
Sorting:
- ☆19Aug 4, 2025Updated 7 months ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated 10 months ago
- Repository for conditional transport☆15Jan 12, 2022Updated 4 years ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆31Feb 26, 2025Updated last year
- ☆20Oct 10, 2025Updated 4 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆21Apr 2, 2024Updated last year
- Latest Evaluation Toolkit (LatestEval). Assessing the language models with latest, uncontaminated materials.☆28Feb 17, 2025Updated last year
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆69Aug 18, 2023Updated 2 years ago
- 本文提出了一种基于多视图卷积神经网络的三维物体识别算法,以实现三维物体的准确识别。首先实现一个标准的卷积神经网络架构,该架构经过训练可以独立地识别形状的渲染视图,以实现即使从单一视图中也可以识别出一个三维形状。随后使用该三维物体多个角度的二维视图通过卷积神经网络识别的结果进…☆11May 16, 2022Updated 3 years ago
- Physical Downlink Shared Channel (PDSCH) in 5G New Radio.☆12Jan 29, 2024Updated 2 years ago
- ☆30Dec 27, 2024Updated last year
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆54Apr 6, 2025Updated 10 months ago
- 这是我的博客《不用框架,使用Python搭建基于numpy的卷积神经网络来进行cifar-10分类的深度学习系统》的代码实现。☆10Jul 1, 2019Updated 6 years ago
- ☆11Mar 31, 2022Updated 3 years ago
- I have developed a custom environment using OpenAI Gym in Python for simulating a 5G wireless communication channel as part of a reinforc…☆13Mar 27, 2024Updated last year
- ☆12Apr 5, 2019Updated 6 years ago
- Dense Wireless Connectivity Datasets for the IoT.☆11Aug 13, 2019Updated 6 years ago
- [NeurIPS'23] Binary Classification with Confidence Difference☆10May 13, 2024Updated last year
- Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"☆16Mar 18, 2025Updated 11 months ago
- wifi☆12Jun 13, 2017Updated 8 years ago
- This project demonstrates how Low Density Parity Check (LDPC) Code and Multiple Input Multiple Output (MIMO) can be employed in Vehicular…☆14Jan 24, 2022Updated 4 years ago
- Master Thesis☆10Jan 28, 2023Updated 3 years ago
- Code for our paper "Performance Study on a CSMA/CA-Based MAC Protocol for Multi-User MIMO Wireless LANs"☆12Aug 31, 2019Updated 6 years ago
- AI-based Resource Provisioning of IoE Services in 6G: A Deep Reinforcement Learning Approach☆12Mar 31, 2021Updated 4 years ago
- A Federated Learning Method for Real-time Emotion State Classification from Multi-modal Streaming☆11Sep 15, 2022Updated 3 years ago
- A short review on beamforming algorithms (Phase Shift, MVDR, LCMV) on Phased Array Radar Systems. Created on MATLAB R2021b.☆12May 21, 2023Updated 2 years ago
- In this tutorial we will introduce software defined radios (SDR) and explore the application of machine learning (ML) to radio frequency …☆11Sep 27, 2022Updated 3 years ago
- Prototype of Python / GeoGebra interoperability☆17Feb 7, 2026Updated 3 weeks ago
- A Beginner's Python Guide for Data Analysis☆22Nov 5, 2019Updated 6 years ago
- Source code for ComNet paper: Satellite multi-beam multicast support for an efficient community-based CDN☆10Jul 26, 2022Updated 3 years ago
- Learning Environment-aware and hardware-compatible beam-forming codebooks☆15Mar 8, 2020Updated 5 years ago
- Go Board FPGA Project for Ambient Light Sensor in VHDL and Verilog☆10Apr 20, 2019Updated 6 years ago
- ☆43Sep 19, 2024Updated last year
- Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"☆41Sep 24, 2024Updated last year
- Python module/package to read, handle and operate on HDF5 files generated by Multi Channel Systems MCS GmbH software☆10May 19, 2022Updated 3 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Official repository for Targeted Unlearning with Single Layer Unlearning Gradient (SLUG), ICML 2025☆15Aug 10, 2025Updated 6 months ago
- DSP University Project - Matlab, Simulations, and Verilog Files☆13Jan 14, 2020Updated 6 years ago
- ☆15Mar 13, 2025Updated 11 months ago