Notes on Direct Preference Optimization
☆28Apr 14, 2024Updated 2 years ago
Alternatives and similar repositories for dpo-notes
Users that are interested in dpo-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distributed training (multi-node) of a Transformer model☆99Apr 10, 2024Updated 2 years ago
- Notes on the Mistral AI model☆21Dec 27, 2023Updated 2 years ago
- Notes and commented code for RLHF (PPO)☆136Feb 27, 2024Updated 2 years ago
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆11Dec 24, 2023Updated 2 years ago
- Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"☆16Dec 16, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- https://www.coursera.org/learn/advanced-methods-reinforcement-learning-finance?☆20Dec 26, 2021Updated 4 years ago
- Registration-aided 3D Point Cloud Learning for Large-Scale Place Recognition (IROS 2021)☆11May 28, 2022Updated 4 years ago
- The Oxford RobotCar Facade dataset.☆11Jun 4, 2022Updated 4 years ago
- [ICML 2024] DPZero: Private Fine-Tuning of Language Models without Backpropagation☆17Sep 4, 2024Updated last year
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆183Jan 7, 2024Updated 2 years ago
- ☆15Feb 28, 2024Updated 2 years ago
- On the Robustness of Graph Neural Diffusion to Topology Perturbations☆15Nov 4, 2022Updated 3 years ago
- ☆14Aug 7, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- High-Performance Embeddable Vector Database with Document Storage, Hybrid Search, and Filtering☆82Jun 4, 2026Updated last month
- ☆14Nov 5, 2024Updated last year
- Collection of resources for RL and Reasoning☆27Feb 3, 2025Updated last year
- CUHK-SZ CSC4180: Compiler Construction Course for Undergraduate Students☆10May 11, 2025Updated last year
- Graph Neural Convection-Diffusion with Heterophily☆11May 29, 2023Updated 3 years ago
- Large-scale Self-supervised Pre-training for Endoscopy☆53Jun 11, 2024Updated 2 years ago
- AnyAccomp: Generalizable accompaniment generation for vocals and solo instruments, powered by a quantized melodic bottleneck.☆38Dec 22, 2025Updated 6 months ago
- Creates subsets of ImageNet (e.g. ImageNet100)☆13Feb 28, 2024Updated 2 years ago
- [COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model☆26Nov 25, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆16Apr 28, 2023Updated 3 years ago
- ☆28Mar 11, 2025Updated last year
- EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation☆25Oct 30, 2024Updated last year
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- Some LaTeX Tips for Writing Research Papers☆10May 30, 2016Updated 10 years ago
- ☆25Mar 6, 2025Updated last year
- Library for simulating time progression in Python☆16Aug 16, 2025Updated 10 months ago
- Tutorial on using Hugging Face's Vision Transformers for Image Classification☆10Sep 4, 2021Updated 4 years ago
- Reference implementation of Mistral AI 7B v0.1 model.☆28Dec 25, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Some recipes for data engineering with Python☆25Mar 23, 2021Updated 5 years ago
- This is a project to auto-deploy with an ML payload☆23Oct 20, 2023Updated 2 years ago
- [NeurIPS 2025] Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling☆25May 20, 2026Updated last month
- Notes on quantization in neural networks☆130Dec 14, 2023Updated 2 years ago
- 3D Slicer extension for SegmentAnyBone developed by Mazurowski Lab☆16Feb 25, 2026Updated 4 months ago
- Notes about LLaMA 2 model☆75Aug 30, 2023Updated 2 years ago
- Automated testing tool to find logic bugs in graph database systems☆21Oct 31, 2023Updated 2 years ago