Notes on Direct Preference Optimization
☆25Apr 14, 2024Updated 2 years ago
Alternatives and similar repositories for dpo-notes
Users that are interested in dpo-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distributed training (multi-node) of a Transformer model☆96Apr 10, 2024Updated 2 years ago
- Notes on the Mistral AI model☆20Dec 27, 2023Updated 2 years ago
- Multi-modal Sarcasm Detection and Humor Classification in Code-mixed Conversations☆13May 31, 2021Updated 4 years ago
- https://www.coursera.org/learn/advanced-methods-reinforcement-learning-finance?☆20Dec 26, 2021Updated 4 years ago
- Academic Resources for the Courses at IIITD - Monsoon 2021 onwards☆10Sep 23, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Registration-aided 3D Point Cloud Learning for Large-Scale Place Recognition (IROS 2021)☆11May 28, 2022Updated 3 years ago
- The Oxford RobotCar Facade dataset.☆11Jun 4, 2022Updated 3 years ago
- My Academic Website:https://jerrysys.top☆19Feb 21, 2025Updated last year
- ☆15Feb 28, 2024Updated 2 years ago
- ☆14Aug 7, 2023Updated 2 years ago
- Collection of resources for RL and Reasoning☆27Feb 3, 2025Updated last year
- CUHK-SZ CSC4180: Compiler Construction Course for Undergraduate Students☆10May 11, 2025Updated 11 months ago
- Graph Neural Convection-Diffusion with Heterophily☆11May 29, 2023Updated 2 years ago
- Because it's there.☆16Sep 22, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆54Feb 19, 2025Updated last year
- [COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model☆27Nov 25, 2025Updated 5 months ago
- ☆14Feb 24, 2023Updated 3 years ago
- Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"☆46Feb 20, 2025Updated last year
- Direct Preference Optimization Implementation☆17Feb 1, 2024Updated 2 years ago
- ☆17Jun 2, 2020Updated 5 years ago
- ☆21Mar 6, 2025Updated last year
- FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists☆31Aug 14, 2025Updated 8 months ago
- CS Experience Labs' web application.☆21Apr 17, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Materials for demonstrating video model deployment☆17Jun 14, 2020Updated 5 years ago
- Some recipes for data engineering with Python☆25Mar 23, 2021Updated 5 years ago
- Notes on quantization in neural networks☆124Dec 14, 2023Updated 2 years ago
- (AAAI 2024) DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition☆26Apr 15, 2024Updated 2 years ago
- Adversarial Robustness in Graph Neural Networks: A Hamiltonian Energy Conservation Approach☆16Apr 27, 2024Updated 2 years ago
- Tutorial Kubernetes Operator☆16Aug 14, 2021Updated 4 years ago
- ☆14May 9, 2024Updated last year
- ☆12Mar 4, 2025Updated last year
- Azure DevOps workflow for ML☆20Mar 29, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- helper functions for processing and integrating visual language information with Qwen-VL Series Model☆18Aug 30, 2024Updated last year
- Understanding how features learned by neural networks evolve throughout training☆41Oct 24, 2024Updated last year
- Solve puzzles. Learn CUDA.☆62Dec 13, 2023Updated 2 years ago
- ☆17May 6, 2025Updated 11 months ago
- ACL24☆11Jun 7, 2024Updated last year
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- LLaMA 2 implemented from scratch in PyTorch☆369Sep 25, 2023Updated 2 years ago