Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs
☆13Feb 13, 2024Updated 2 years ago
Alternatives and similar repositories for refined-dpo
Users that are interested in refined-dpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python implementation of discrete optimal transport with a Tsallis entropy regularization.☆14Oct 23, 2023Updated 2 years ago
- ☆16Mar 12, 2024Updated 2 years ago
- Get more done with LLMs☆13Jan 19, 2024Updated 2 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Mar 23, 2026Updated 3 weeks ago
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- An updated version of eICU Benchmark with an updated problem definition on LoS and Decompensation tasks☆12Aug 12, 2021Updated 4 years ago
- Custom Keras layers for implementing multi-dimensional recurrent neural networks (MDRNNs) described in Alex Graves's paper https://arxiv.…☆10Apr 27, 2020Updated 5 years ago
- This repository accompaines the paper "Investigating Gender Fairness of Recommendation Algorithms in the Music Domain"☆15Jul 13, 2021Updated 4 years ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated last year
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- ☆15Oct 25, 2021Updated 4 years ago
- A curated list of personalized Language model / Large language model (continually updated)☆10Nov 17, 2023Updated 2 years ago
- Faithfully Explainable Recommendation via Neural Logic Reasoning☆16May 3, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Aug 16, 2024Updated last year
- ☆14Jun 28, 2022Updated 3 years ago
- ☆11Jul 13, 2018Updated 7 years ago
- Dynamic prompt☆28Aug 29, 2017Updated 8 years ago
- Supercharging Imbalanced Data Learning WithCausal Representation Transfer☆12Nov 29, 2021Updated 4 years ago
- ☆16Mar 27, 2023Updated 3 years ago
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)☆11Jun 12, 2020Updated 5 years ago
- Investigating Cultural Alignment of Large Language Models☆13Aug 14, 2024Updated last year
- ☆18Dec 2, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- code for promptCSE, emnlp 2022☆11Apr 10, 2023Updated 3 years ago
- The official implementation of our paper "TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detec…☆33May 20, 2024Updated last year
- Run Deekseek LLM model locally with Ollama, deepseek-r1:1.5b, and React☆11Jan 29, 2025Updated last year
- ☆12Feb 18, 2020Updated 6 years ago
- Lambda Networks implemented in PyTorch☆13Feb 22, 2021Updated 5 years ago
- Dataset citeulike-t for 'Collaborative Topic Regression with Social Regularization' (CTRSR)☆18Jul 13, 2021Updated 4 years ago
- ☆10Nov 30, 2022Updated 3 years ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- How to connect to AirBears2 on Linux (Ubuntu, Linux Mint, Kubuntu)☆12Dec 2, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- the code of our paper "Beyond Matching: Modeling Two-Sided Multi-Behavioral Sequences For Dynamic Person-Job Fit" (实现十多个人岗匹配模型和动态人岗匹配模型的算…☆16Aug 10, 2023Updated 2 years ago
- ☆16Dec 8, 2020Updated 5 years ago
- Exprec records your experiments so you can compare different runs and easily reproduce results☆12Jan 6, 2026Updated 3 months ago
- Implementation of paper Long-Term Effect Estimation with Surrogate Representation☆14Oct 20, 2020Updated 5 years ago
- DSTC9 Submission☆16Apr 12, 2021Updated 5 years ago
- Youtube Too Long Didn't Watch☆13Sep 2, 2024Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 7 months ago