sssth / awesome-DPOView external linksLinks
papers related to Direct Preference Optimization(DPO)
☆19Jul 16, 2024Updated last year
Alternatives and similar repositories for awesome-DPO
Users that are interested in awesome-DPO are comparing it to the libraries listed below
Sorting:
- [ICML 2023] Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optim…☆10Dec 19, 2023Updated 2 years ago
- Automating Sub-Agent Creation for Agentic Orchestration☆30Updated this week
- Github repo for Microsoft hackathon 2024 Nov - https://microsoftfabric.devpost.com/?ref_content=default&ref_feature=challenge&ref_medium=…☆11Dec 30, 2024Updated last year
- Image Background Removal and Replacement using Machine Learning and AI☆11Mar 21, 2023Updated 2 years ago
- Engineering degree thesis - Structured Light based 3D Scanner☆13Mar 15, 2017Updated 8 years ago
- The datasets of TSAD☆12Oct 20, 2025Updated 3 months ago
- ☆14Oct 3, 2024Updated last year
- Arduino Library for ADXL362 Micropower 3-axis accelerometer☆18Nov 18, 2022Updated 3 years ago
- [SIGIR'25] Code of "Generative Recommender with End-to-End Learnable Item Tokenization".☆23Apr 17, 2025Updated 9 months ago
- ICS_2020_PJ☆11Dec 25, 2020Updated 5 years ago
- ☆16Sep 5, 2023Updated 2 years ago
- ☆15May 22, 2025Updated 8 months ago
- The collection of related papers and resources for the paper Time Series Analysis for Education: Methods, Applications, and Future Direct…☆18Apr 12, 2025Updated 10 months ago
- a version of fast_Dreambooth by TheLastBen for kaggle notebook☆17Jun 1, 2023Updated 2 years ago
- Algebraic value editing in pretrained language models☆68Nov 1, 2023Updated 2 years ago
- Stores here are the source codes for the official implementation of "Generating Traffic Scenarios via In-Context Learning to Learn Better…☆21May 1, 2025Updated 9 months ago
- A client library for Rainbow Robotics' cobots☆16Dec 2, 2025Updated 2 months ago
- GPU-based Massively Parallel Environments for Large-Scale Combinatorial Optimization (CO) Problems Using Reinforcement Learning☆28Feb 6, 2026Updated last week
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆24Sep 26, 2024Updated last year
- ☆18Oct 8, 2024Updated last year
- A Survey of Direct Preference Optimization (DPO)☆91Jul 4, 2025Updated 7 months ago
- particle filter based object tracking☆17Mar 9, 2020Updated 5 years ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆20Feb 26, 2025Updated 11 months ago
- ☆16Dec 2, 2018Updated 7 years ago
- All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks☆18Apr 24, 2024Updated last year
- Official code implementation of SKU, Accepted by ACL 2024 Findings☆20Dec 18, 2024Updated last year
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- Upload an image of a document and extract text, names, facts and figures☆21Aug 12, 2024Updated last year
- Torchserve + TensorRT + Detection☆19Feb 16, 2022Updated 3 years ago
- ☆28Jul 16, 2024Updated last year
- [NeurIPS‘2021] "MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge", Geng Yuan, Xiaolong Ma, Yanzhi Wang et al…☆18Mar 16, 2022Updated 3 years ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆60Jan 28, 2026Updated 2 weeks ago
- ☆17Aug 1, 2025Updated 6 months ago
- ☆36Jul 2, 2025Updated 7 months ago
- ☆23Apr 14, 2024Updated last year
- 中国科学院大学研究生课程--自然语言处理☆21Jan 8, 2022Updated 4 years ago
- Python and OpenCV program to estimate Fundamental and Essential matrix between successive frames to estimate the rotation and the transla…☆22May 21, 2019Updated 6 years ago
- Official codes for paper: Autonomous Driving Scenario Generation via Reversely Regularized Hybrid Offline-and-Online Reinforcement Learni…☆24Nov 23, 2023Updated 2 years ago
- Code for the CVPR 2021 paper "Improved Handling of Motion Blur in Online Object Detection"☆23May 10, 2022Updated 3 years ago