This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)
☆18Jan 9, 2025Updated last year
Alternatives and similar repositories for SFT-and-DPO
Users that are interested in SFT-and-DPO are comparing it to the libraries listed below
Sorting:
- [🎖️1등(장관상) 솔루션] 2022 국립국어원 인공 지능 언어 능력 평가 (쇼핑몰 리뷰 데이터 속성 기반 감성 분석 : Aspect-Based Sentiment Analysis)☆11Jun 6, 2023Updated 2 years ago
- I love reinforcement learning.☆12Jan 15, 2025Updated last year
- ☆10Oct 23, 2017Updated 8 years ago
- 🎨 Convert to CSS filter be like using Hex, RGB or HSL☆11Apr 28, 2023Updated 2 years ago
- This piece of code employs GPA for face alignment☆10Jun 21, 2019Updated 6 years ago
- Extract the key frame from the tested video, and then search the most similar Images from the database, which consists over 1,4000 pictur…