Notes on Direct Preference Optimization
☆24Apr 14, 2024Updated last year
Alternatives and similar repositories for dpo-notes
Users that are interested in dpo-notes are comparing it to the libraries listed below
Sorting:
- Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"☆15Dec 16, 2025Updated 2 months ago
- Multi-modal Sarcasm Detection and Humor Classification in Code-mixed Conversations☆13May 31, 2021Updated 4 years ago
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆18Apr 15, 2025Updated 10 months ago
- ☆18May 11, 2024Updated last year
- https://www.coursera.org/learn/advanced-methods-reinforcement-learning-finance?☆20Dec 26, 2021Updated 4 years ago
- ML algorithms implementations that are good for learning the underlying principles☆27Dec 7, 2024Updated last year
- FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists☆31Aug 14, 2025Updated 6 months ago
- ☆12Sep 21, 2023Updated 2 years ago
- Understanding how features learned by neural networks evolve throughout training☆41Oct 24, 2024Updated last year
- Official implementation of the paper "On the Importance of Environments in Human-Robot Coordination", published in RSS 2021.☆16May 1, 2024Updated last year
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- 🎹🎵🎶 A platform to make Original and Cover Visible and Valuable.☆13Nov 8, 2022Updated 3 years ago
- ☆12Oct 29, 2023Updated 2 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- An AI tool that turns ideas into act structures into fleshed-out stories using an iterate-refine loop. Built to illustrate what's readily…☆12Aug 21, 2023Updated 2 years ago
- Seldon Core Operator for Kubernetes☆13Nov 5, 2019Updated 6 years ago
- CAR-bench☆21Feb 23, 2026Updated last week
- Full List of Bad Words and Top Swear Words Banned by Google. As they closed the api☆12Sep 26, 2018Updated 7 years ago
- ☆10May 27, 2024Updated last year
- Prototype of TypeChat in Python☆11Oct 21, 2023Updated 2 years ago
- ☆14Jun 24, 2024Updated last year
- [ICLR 2026] Any-step Generation via N-th Order Recursive Consistent Velocity Field Estimation☆34Feb 4, 2026Updated 3 weeks ago
- ACL24☆11Jun 7, 2024Updated last year
- On the Robustness of Graph Neural Diffusion to Topology Perturbations☆16Nov 4, 2022Updated 3 years ago
- V2 of CodeGraphy. VSCode force-based graph extension for displaying file connections☆13Jun 10, 2023Updated 2 years ago
- Repo for MGraph project☆13Jan 10, 2026Updated last month
- Benchmarks for Business Document Foundation Models☆10Apr 4, 2024Updated last year
- ☆13May 9, 2024Updated last year
- Script for using Bing chat like a meal delivery service.☆12Mar 15, 2023Updated 2 years ago
- Video about NP-completeness, circuit SAT and "reversing time"☆15Aug 18, 2024Updated last year
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- ☆12Nov 1, 2023Updated 2 years ago
- ☆12Apr 24, 2024Updated last year
- Deep Learning with Multiple Objectives: 2021 edition☆10May 27, 2021Updated 4 years ago
- A pipeline for phylogenetic diversity analysis of GBIF-mediated data☆13May 30, 2025Updated 9 months ago
- ☆34Oct 7, 2025Updated 4 months ago
- 🤖📚 Telegram bot to convert and email PDFs, EPUBs or MOBIs to your Kindle☆11Sep 16, 2022Updated 3 years ago
- UniPrompt provides a unified interface to prompt optimization. We have distilled common functions from different algorithms and provide a…☆19May 20, 2025Updated 9 months ago
- Scalable Computation of Hessian Diagonals☆14Jun 2, 2024Updated last year