DPO-Shift: Shifting the Distribution of Direct Preference Optimization
☆59Mar 5, 2025Updated last year
Alternatives and similar repositories for DPO-Shift
Users that are interested in DPO-Shift are comparing it to the libraries listed below
Sorting:
- ☆38Sep 15, 2025Updated 5 months ago
- The official implementation of Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion [AAAI'2…☆15Feb 2, 2026Updated last month
- [COLM 2025] Assessing Judging Bias in Large Reasoning Models: An Empirical Study https://openreview.net/pdf?id=SlRtFwBdzP☆163Sep 21, 2025Updated 5 months ago
- ☆16May 13, 2025Updated 9 months ago
- Align Anything: Training All-modality Model with Feedback☆4,635Nov 27, 2025Updated 3 months ago
- MegaRAG: Multimodal Graph-based RAG☆36Sep 16, 2025Updated 5 months ago
- Klavis AI (YC X25): MCP integration platforms that let AI agents use tools reliably at any scale☆5,651Updated this week
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽…☆3,882Updated this week
- Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.☆3,150Dec 15, 2025Updated 2 months ago
- 数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色、数据目录、安全控制等功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸…☆2,574Feb 27, 2026Updated last week
- ☆1,370Oct 9, 2024Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Oct 9, 2025Updated 4 months ago
- official implementation of "CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusi…☆18Sep 5, 2024Updated last year
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- ☆27Jun 18, 2025Updated 8 months ago
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated 11 months ago
- 🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用☆3,246Jan 14, 2026Updated last month
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆19Jan 25, 2025Updated last year
- FIT: 企业级AI开发框架,提供多语言函数引擎(FIT)、流式编排引擎(WaterFlow)及Java生态的LangChain替代方案(FEL)。原生/Spring双模运行,支持插件热插拔与智能聚散部署,无缝统一大模型与业务系统。☆2,102Feb 26, 2026Updated last week
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Jun 25, 2024Updated last year
- 基于lua实现Playwright控制,方便大模型(ChatGPT、DeepSeek控制)☆33Mar 1, 2025Updated last year
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆21Feb 17, 2025Updated last year
- The next generation deep reinforcement learning tookit☆3,462Jun 16, 2023Updated 2 years ago
- Applications self-hosting and DevOps platform for running open source, web-based linux Panel of lite PaaS☆2,085Feb 12, 2026Updated 3 weeks ago
- ☆517Feb 28, 2025Updated last year
- ☆33Jul 15, 2025Updated 7 months ago
- 跃入Spring的汪洋大海,探寻Spring Boot与各种框架的完美融合之道,从基础到高级,涵盖Spring Boot、Spring Boot & Shiro、Spring Security Oauth2、Spring Cloud等等,简洁而易懂的示例,带你领略Sprin…☆29Sep 1, 2023Updated 2 years ago
- An interactive React 18 portfolio featuring AI-powered career assistance, dynamic project showcases with live previews, smooth Framer Mot…☆90Sep 13, 2025Updated 5 months ago
- AI-powered tool for efficient abstract and PDF screening in systematic reviews.☆1,304Feb 27, 2026Updated last week
- Run AI models end-to-end encrypted.☆3,065Feb 10, 2025Updated last year
- Test-time Scaling for VAR models☆31Sep 19, 2025Updated 5 months ago
- [NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆1,242Jan 16, 2026Updated last month
- https://hcv.boyuai.com☆999Dec 18, 2024Updated last year
- 悟空CRM-基于Spring Cloud Alibaba微服务架构 +vue ElementUI的前后端分离CRM系统☆2,407Aug 27, 2021Updated 4 years ago
- [NeurIPS'25] SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning☆41Oct 14, 2025Updated 4 months ago
- ☆105Sep 18, 2025Updated 5 months ago
- [CVPR 2025] Offical implementation of the paper "Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters The…☆31Feb 27, 2025Updated last year
- Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mo…☆7,741Feb 26, 2026Updated last week
- Official Repository for Critical Assessment of Protein Engineering(CAPE)☆86Sep 22, 2025Updated 5 months ago