Improving Math reasoning through Direct Preference Optimization with Verifiable Pairs
☆19Mar 20, 2025Updated 11 months ago
Alternatives and similar repositories for DPO-VP
Users that are interested in DPO-VP are comparing it to the libraries listed below
Sorting:
- Perception Matters: Exploring Imperceptible and Transferable Anti-forensics for GAN-generated Fake Face Imagery Detection☆11Jan 23, 2023Updated 3 years ago
- 2024年第六届全球校园人工智能算法精英大赛AI生成人脸图像鉴别☆15May 30, 2025Updated 9 months ago
- Official repository for the paper "On the use of Benford's law to detect GAN-generated images", ICPR2020☆13Apr 7, 2021Updated 4 years ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 5 months ago
- Multimodal Federated Learning on IoT Data☆11Dec 17, 2023Updated 2 years ago
- Code for paper: Reinforced Vision Perception with Tools☆71Oct 3, 2025Updated 5 months ago
- Accelerating RL for LLM Reasoning with Optimal Advantage Regression☆37May 30, 2025Updated 9 months ago
- Self-Teaching Notes on Gradient Leakage Attacks against GPT-2 models.☆14Mar 18, 2024Updated last year
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆14Jun 28, 2025Updated 8 months ago
- ☆21Jul 8, 2025Updated 7 months ago
- A-Soul-Data Json数据存放☆13Sep 17, 2022Updated 3 years ago
- ☆12Jul 24, 2024Updated last year
- ☆13May 15, 2025Updated 9 months ago
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- Official code for the paper "Adversarial Magnification to Deceive Deepfake Detection through Super Resolution"☆12Jun 26, 2023Updated 2 years ago
- MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆22Sep 23, 2025Updated 5 months ago
- The official implementation of paper "Overcoming Data and Model heterogeneities in Decentralized Federated Learning via Synthetic Anchors…☆14Jun 14, 2024Updated last year
- Secure and Scalable Federated Learning using Serverless Computing☆12Jan 31, 2024Updated 2 years ago
- Code accompanying the 2022 DLS paper "Misleading Deep-Fake Detection with GAN Fingerprints"☆10May 26, 2022Updated 3 years ago
- IPO: Interpretable Prompt Optimization for Vision-Language Models(NeurIPS 2024)☆15Mar 4, 2025Updated last year
- ☆10Jul 24, 2023Updated 2 years ago
- The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension☆12Oct 23, 2022Updated 3 years ago
- Official repository of "Beyond Spatial Frequency: Pixel-wise Temporal Frequency-based Deepfake Video Detection" [ICCV 2025]☆20Jan 17, 2026Updated last month
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆25May 31, 2025Updated 9 months ago
- [AAAI 2025 Oral] ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks https://arxiv.org/…☆10Jun 25, 2025Updated 8 months ago
- ☆12Aug 16, 2018Updated 7 years ago
- code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"☆14Nov 17, 2023Updated 2 years ago
- Liu, Zichuan, et al. "Multi-View Spatial-Temporal Model for Travel Time Estimation." Proceedings of the 29th International Conference on …☆12Nov 24, 2021Updated 4 years ago
- [ICCV 2025 Highlight] LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs☆20Nov 16, 2025Updated 3 months ago
- Robust Camera Trace Extraction (TIFS'23)☆12Oct 3, 2023Updated 2 years ago
- [EMNLP'22] Textual Manifold-based Defense Against Natural Language Adversarial Examples☆11Apr 6, 2023Updated 2 years ago
- Sparse-RS: a versatile framework for query-efficient sparse black-box adversarial attacks☆46Feb 24, 2022Updated 4 years ago
- TurboFuzzLLM: Turbocharging Mutation-based Fuzzing for Effectively Jailbreaking Large Language Models in Practice☆22Nov 24, 2025Updated 3 months ago
- Official repository for "Investigating Pre-Training Objectives for Generalization in Visual Reinforcement Learning" (ICML 2024)☆11Sep 16, 2025Updated 5 months ago
- ☆20Sep 23, 2025Updated 5 months ago
- [ACL 2025 Main] Open-source toolkit for automatic evaluation of text-to-image generation task, including training & test datasets and a d…☆16Jul 5, 2025Updated 7 months ago
- Official implementation for P2SAM (ACM MM 2024)☆14Dec 7, 2024Updated last year
- ☆16Mar 1, 2025Updated last year
- AI-GenBench: A New Ongoing Benchmark for AI-Generated Image Detection☆25Feb 2, 2026Updated last month