[TMLR] Triple Preference Optimization
☆30Feb 19, 2025Updated last year
Alternatives and similar repositories for TPO
Users that are interested in TPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Dec 14, 2024Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- ☆13Jun 4, 2024Updated last year
- An Easy Annotation Tool for Natural Language Processing☆11May 17, 2024Updated 2 years ago
- Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure (NeurIPS 2024) + Arithmetic Transfor…☆14Oct 26, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Nov 19, 2022Updated 3 years ago
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"☆15Jul 2, 2024Updated last year
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆14Aug 2, 2024Updated last year
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated last year
- ☆17Oct 31, 2023Updated 2 years ago
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year
- Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"☆19May 25, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆20Oct 25, 2022Updated 3 years ago
- Implementation of OpenAI paper with Simple Noise Scale on Fastai V2☆19Apr 16, 2021Updated 5 years ago
- Crosslingual Reasoning through Test-Time Scaling☆19May 13, 2025Updated last year
- A General Quantum Software☆21May 17, 2026Updated last week
- ☆19Mar 31, 2024Updated 2 years ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆23Jun 15, 2025Updated 11 months ago
- ☆17Aug 13, 2024Updated last year
- This is the official implementation of our ICML 2024 paper "MultiMax: Sparse and Multi-Modal Attention Learning""☆22Feb 9, 2026Updated 3 months ago
- ☆11Jan 3, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Apr 28, 2021Updated 5 years ago
- Code for Q-learning with parametrized quantum circuits in OpenAI Gym environments.☆13Nov 12, 2021Updated 4 years ago
- ☆16Aug 7, 2020Updated 5 years ago
- Host CIFAR-10.2 Data Set☆13Sep 22, 2021Updated 4 years ago
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆20Apr 17, 2026Updated last month
- My YAML cv and resume.☆18May 10, 2026Updated 2 weeks ago
- ☆14Apr 16, 2024Updated 2 years ago
- hierarchical multi-agent workflow for prompt optimazation☆14Jun 12, 2024Updated last year
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆39Apr 8, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official PyTorch implementation of RefRef: A Synthetic Dataset and Benchmark for Reconstructing Refractive and Reflective Objects☆15Mar 2, 2026Updated 2 months ago
- Model Selection with Large Language Models for Reasoning (EMNLP2023 Findings)☆30Dec 23, 2023Updated 2 years ago
- Official implementation of SimFlow☆31Dec 16, 2025Updated 5 months ago
- QF-based Hybrid DRL Portfolio Investment System☆14Aug 13, 2023Updated 2 years ago
- dxFeed candle charting demo web widget. This widget is built using the dxFeed Python API library and Dash framework and shows a 5-minute …☆16Sep 22, 2020Updated 5 years ago
- Longitudinal Evaluation of LLMs via Data Compression☆33May 29, 2024Updated last year
- ☆11Jun 11, 2025Updated 11 months ago