This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)
☆18Jan 9, 2025Updated last year
Alternatives and similar repositories for SFT-and-DPO
Users that are interested in SFT-and-DPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆16Mar 23, 2025Updated last year
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- ☆17Dec 7, 2025Updated 3 months ago
- Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'☆16Apr 24, 2025Updated 11 months ago
- This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".☆12Feb 27, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The code for “PromptNER: A Prompting Method for Few-shot Named Entity Recognition via k Nearest Neighbor Search”☆19Mar 13, 2024Updated 2 years ago
- Zeroth-Order Fine-Tuning of LLMs in Random Subspaces (ICCV 2025)☆18Nov 22, 2024Updated last year
- ☆17Nov 8, 2023Updated 2 years ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- 集中管理所有的prompt。☆14Nov 27, 2024Updated last year
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 7 months ago
- ☆16Jul 12, 2024Updated last year
- 🖖 图谱式笔记系统,旨在提高个人笔记的使用率!☆12Jan 17, 2021Updated 5 years ago
- 文本生成 - 通过商品参数和图片自动生成营销文本☆12Sep 17, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Crawled Wikipedia Tables with Passages☆13Aug 19, 2021Updated 4 years ago
- ☆27Dec 29, 2023Updated 2 years ago
- Toonification of real face images using PyTorch, Stylegan2 and Image-to-Image translation☆13Jun 14, 2022Updated 3 years ago
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆26Updated this week
- Object tracking based on SiamFC & DaSiamRPN using GOT-10k toolkit. Demo & Visualization.☆10Jun 29, 2020Updated 5 years ago
- ☆18Nov 22, 2025Updated 4 months ago
- ☆15Jan 21, 2025Updated last year
- A simple implementation of ReasonGenRM.☆19Apr 21, 2025Updated 11 months ago
- 基于deepsetAI的开源项目haystack进行修改,使其支持中文场景下的任务☆23Dec 11, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- I modified some code of K-BERT so that it can be fit to English datasets Topics Resources☆11Dec 15, 2022Updated 3 years ago
- For converting LLM datasets from one format into another.☆22Nov 12, 2025Updated 4 months ago
- This project implements optimizers for TensorFlow and Keras, which can be used in the same way as Keras optimizers. Machine learning, Dee…☆49Mar 11, 2026Updated 2 weeks ago
- Pretrained Language Model(from huggingface)을 사용하여 간단하게 비슷한 의미를 가지는 문장을 찾을 수 있는 metric을 제공☆13Jul 6, 2023Updated 2 years ago
- Traffic Light recognition using FasterRCNN in Pytorch☆11Jul 23, 2023Updated 2 years ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆30Jan 10, 2026Updated 2 months ago
- A simple modification on the official DETR codebase with support to Finetune on custom dataset☆14Nov 26, 2020Updated 5 years ago
- Data Science & Machine Learning Project applied to Healthcare☆16Dec 1, 2021Updated 4 years ago
- Image Segmentation On Custom Dataset Using YOLOv8☆19Jan 12, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🧀 KoBART summarization using pytorch☆13Jun 7, 2023Updated 2 years ago
- 基于中文的营销文本生成,基于Pointer Generator Network和Converage的实现,此外还尝试各种文本数据增广和优化技巧☆18Sep 5, 2020Updated 5 years ago
- MSRSegNet: Multi-Scale Residual Network for Semantic Segmentation☆10Aug 9, 2018Updated 7 years ago
- Code for the Paper "ERASE: Benchmarking Feature Selection Methods for Deep Recommender Systems"☆30Jan 2, 2025Updated last year
- 复现Wav2Lip作者新的论文☆20Jun 20, 2023Updated 2 years ago
- LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset☆14Feb 2, 2025Updated last year
- 特征离散化方法总结☆18Nov 11, 2020Updated 5 years ago