Direct Preference Optimization from scratch in PyTorch
☆130Apr 7, 2025Updated last year
Alternatives and similar repositories for Direct-Preference-Optimization
Users that are interested in Direct-Preference-Optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆14Feb 20, 2024Updated 2 years ago
- Reference implementation for DPO (Direct Preference Optimization)☆2,886Aug 11, 2024Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Natural Universal Trigger Search (NUTS)☆21Apr 17, 2021Updated 5 years ago
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆62Jul 21, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 如何修改Isaac Gym中的地形——基于legged_gym框架☆39Mar 22, 2025Updated last year
- End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.☆10Jan 21, 2022Updated 4 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆34Dec 14, 2023Updated 2 years ago
- ☆27Sep 5, 2024Updated last year
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆26Sep 19, 2024Updated last year
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆18Sep 15, 2023Updated 2 years ago
- [AAMAS 2025] Privacy-preserving and Personalized RLHF, with convergence guarantees. The Code contains experiments for training multiple i…☆16Apr 16, 2025Updated last year
- FeatureAlignment = Alignment + Mechanistic Interpretability☆35Mar 8, 2025Updated last year
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paper☆17Apr 19, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆331Jan 29, 2026Updated 4 months ago
- Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binari…☆16Aug 25, 2017Updated 8 years ago
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆223Nov 30, 2025Updated 6 months ago
- ☆12Dec 8, 2022Updated 3 years ago
- 知予人工智能:从学习者到研究者☆13Jan 20, 2025Updated last year
- ☆27Oct 6, 2024Updated last year
- Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021☆44Jul 23, 2021Updated 4 years ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆22Jan 11, 2026Updated 5 months ago
- ☆17Dec 11, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated 2 years ago
- Finetuning Stable Diffusion from Diffusers☆11Mar 11, 2024Updated 2 years ago
- A tool for udacity mentors to analyze the feedback they receive from their students.☆14Jul 10, 2022Updated 3 years ago
- Example code for fine-tuning gemma-3-1b-it to use tools.☆49Jul 27, 2025Updated 10 months ago
- Natural Language to Code☆14May 2, 2021Updated 5 years ago
- Collection of random notes, mostly transcribed from paper and mostly old. I take no responsibility for content!☆12Mar 27, 2020Updated 6 years ago
- Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.☆19Jan 21, 2021Updated 5 years ago
- ☆16Jul 29, 2025Updated 10 months ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Emotion-Aware Dialogue Response Generation by Multi-Task Learning☆13Jan 22, 2022Updated 4 years ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆15Jul 24, 2023Updated 2 years ago
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆13Feb 5, 2024Updated 2 years ago
- ☆51Nov 22, 2024Updated last year
- PoS crypto coin over ipfs distributed storage network (with new consensus protocol 🙌)☆15Apr 10, 2024Updated 2 years ago
- Linter for LaTeX with useful commands for academic writing☆35Updated this week
- ☆16Mar 25, 2022Updated 4 years ago