Paul33333 / SFT-and-DPOView external linksLinks
This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)
☆18Jan 9, 2025Updated last year
Alternatives and similar repositories for SFT-and-DPO
Users that are interested in SFT-and-DPO are comparing it to the libraries listed below
Sorting:
- ☆10Oct 23, 2017Updated 8 years ago
- 文本生成 - 通过商品参数和图片自动生成营销文本☆12Sep 17, 2021Updated 4 years ago
- 自然语言处理_CCF大数据与计算智能大赛_面向数据安全治理的数据内容智能发现与分级分类☆11Nov 17, 2022Updated 3 years ago
- I modified some code of K-BERT so that it can be fit to English datasets Topics Resources☆11Dec 15, 2022Updated 3 years ago
- Extract the key frame from the tested video, and then search the most similar Images from the database, which consists over 1,4000 pictur…☆10Mar 13, 2014Updated 11 years ago
- This is a submission example for CelebA-Spoof Challenge participants.☆10Sep 8, 2020Updated 5 years ago
- Data Science & Machine Learning Project applied to Healthcare☆15Dec 1, 2021Updated 4 years ago
- This piece of code employs GPA for face alignment☆10Jun 21, 2019Updated 6 years ago
- Training PyTorch Faster-RCNN on custom dataset☆14Jun 2, 2021Updated 4 years ago
- Object tracking based on SiamFC & DaSiamRPN using GOT-10k toolkit. Demo & Visualization.☆10Jun 29, 2020Updated 5 years ago
- Apply filtering-based state estimation to determine the pose of a vehicle on the roadway☆11May 4, 2020Updated 5 years ago
- online hard examples mining support for Faster R-CNN end to end.☆11Aug 22, 2017Updated 8 years ago
- ☆12Feb 22, 2023Updated 2 years ago
- Image stitching and 3D point cloud registration using a Kinect camera☆11Sep 9, 2020Updated 5 years ago
- MSRSegNet: Multi-Scale Residual Network for Semantic Segmentation☆10Aug 9, 2018Updated 7 years ago
- Pytorch(0.4.1/1.0 verified) codes and pre-trained models for the paper: Seesaw-Net: Convolution Neural Network With Uneven Group Convolut…☆10Dec 15, 2019Updated 6 years ago
- A simple implementation of LoRA+: Efficient Low Rank Adaptation of Large Models☆10Mar 20, 2024Updated last year
- Implementation of BEVFusion [ICRA 2023] using the SimBEV dataset.☆16Feb 6, 2026Updated last week
- ☆14Jul 16, 2021Updated 4 years ago
- ☆12Jun 18, 2019Updated 6 years ago
- Pretrained Language Model(from huggingface)을 사용하여 간단하게 비슷한 의미를 가지는 문장을 찾을 수 있는 metric을 제공☆13Jul 6, 2023Updated 2 years ago
- [IJCNN'19, IEEE JSTSP'19] Caffe code for our paper "Structured Pruning for Efficient ConvNets via Incremental Regularization"; [BMVC'18] …☆14Feb 14, 2020Updated 6 years ago
- Image Segmentation On Custom Dataset Using YOLOv8☆19Jan 12, 2023Updated 3 years ago
- Jupyter notebooks for course Finetuning Large Language Models, taught by Sharon Zhou (Lamini) and Andrew Ng (DeepLearning.AI).☆16Oct 21, 2023Updated 2 years ago
- ☆12Sep 29, 2017Updated 8 years ago
- Log☆11Nov 8, 2021Updated 4 years ago
- Annotation builder to use segmentation in Mask_RCNN, even if your annotations are rectangular instead of polygon.☆15Feb 16, 2022Updated 4 years ago
- Simulation of a self-driving car game using a Deep Q Learning AI☆16Oct 15, 2017Updated 8 years ago
- Caffe implementation of Dynamic Network Surgery and Incremental Network Quantization☆15Dec 13, 2017Updated 8 years ago
- 基于中文的营销文本生成,基于Pointer Generator Network和Converage的实现,此外还尝试各种文本数据增广和优化技巧☆19Sep 5, 2020Updated 5 years ago
- A semi-weakly supervised object detection technique based on monte carlo sampling for pseudo GT boxes☆12Apr 10, 2022Updated 3 years ago
- ICME 2017 "Learning a Multi-Center Convolutional Network for Unconstrained Face Alignment"☆16Mar 28, 2020Updated 5 years ago
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed☆19May 27, 2024Updated last year
- Python port to the normalizer in https://github.com/twitter/twitter-korean-text☆13Apr 26, 2016Updated 9 years ago
- ☆16Mar 6, 2020Updated 5 years ago
- ☆16Jul 18, 2024Updated last year
- ☆13Jun 3, 2020Updated 5 years ago
- Train a model to detect Chinese traffic signs and signals with tensorflow object detection API☆18Jan 25, 2018Updated 8 years ago
- A four layers CNN model is designed to estimate the eye gaze or the attention☆17Jan 8, 2018Updated 8 years ago