[TMLR] Triple Preference Optimization
☆30Feb 19, 2025Updated last year
Alternatives and similar repositories for TPO
Users that are interested in TPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Flexible Toolkit for Dense Retrieval☆47Nov 12, 2025Updated 5 months ago
- An Easy Annotation Tool for Natural Language Processing☆11May 17, 2024Updated last year
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Nov 19, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Extended Few-Shot Learning: Exploiting Existing Resources for Novel Tasks☆10Jul 6, 2021Updated 4 years ago
- ☆11May 28, 2024Updated last year
- ☆13Jun 26, 2024Updated last year
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated 2 years ago
- Code repository of AI-Endo☆16Jan 16, 2024Updated 2 years ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated last year
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year
- ☆20Oct 25, 2022Updated 3 years ago
- Implementation of OpenAI paper with Simple Noise Scale on Fastai V2☆19Apr 16, 2021Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Crosslingual Reasoning through Test-Time Scaling☆19May 13, 2025Updated 11 months ago
- 🧀 KoBART summarization using pytorch☆13Jun 7, 2023Updated 2 years ago
- ☆19Mar 31, 2024Updated 2 years ago
- ☆21May 27, 2025Updated 11 months ago
- ☆11Jan 3, 2024Updated 2 years ago
- Minimal coding, computer-use and deep research agents using the OpenAI Agents SDK☆35Mar 9, 2026Updated last month
- ☆13Apr 28, 2021Updated 5 years ago
- Code for Q-learning with parametrized quantum circuits in OpenAI Gym environments.☆13Nov 12, 2021Updated 4 years ago
- Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace☆16Mar 12, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- JoinAI是一个开源仓库,专注于算法工程能力的培养,包括工程和数学原理的整理☆11Apr 20, 2025Updated last year
- ☆78Feb 22, 2024Updated 2 years ago
- Host CIFAR-10.2 Data Set☆13Sep 22, 2021Updated 4 years ago
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆20Apr 17, 2026Updated 2 weeks ago
- The implementation of the paper: "Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models"☆34Apr 11, 2024Updated 2 years ago
- My YAML cv and resume.☆18Mar 8, 2026Updated last month
- ☆14Apr 16, 2024Updated 2 years ago
- ☆14Oct 3, 2023Updated 2 years ago
- Useful resources to learn lifelong learning☆22Aug 15, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Mar 1, 2024Updated 2 years ago
- TOKEN-IMPORTANCE GUIDED DIRECT PREFERENCE OPTIMIZATION☆35Jan 26, 2026Updated 3 months ago
- hierarchical multi-agent workflow for prompt optimazation☆14Jun 12, 2024Updated last year
- 记录点云SemanticKITTI论文阅读记录☆17Aug 30, 2021Updated 4 years ago
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆38Apr 8, 2023Updated 3 years ago
- Official PyTorch implementation of RefRef: A Synthetic Dataset and Benchmark for Reconstructing Refractive and Reflective Objects☆15Mar 2, 2026Updated 2 months ago
- Model Selection with Large Language Models for Reasoning (EMNLP2023 Findings)☆30Dec 23, 2023Updated 2 years ago