The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"
☆23Jun 26, 2025Updated 8 months ago
Alternatives and similar repositories for Weak-to-Strong-Decoding
Users that are interested in Weak-to-Strong-Decoding are comparing it to the libraries listed below
Sorting:
- ☆16Sep 17, 2024Updated last year
- ☆26Jul 29, 2025Updated 7 months ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17May 15, 2025Updated 9 months ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Jul 1, 2025Updated 8 months ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆22Feb 13, 2025Updated last year
- ☆15Nov 7, 2024Updated last year
- Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs☆24Jul 5, 2025Updated 7 months ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 4 months ago
- Code for ICLR 2025 Paper "GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment"☆21Feb 10, 2025Updated last year
- Control LLM☆22Apr 6, 2025Updated 10 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Oct 9, 2025Updated 4 months ago
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆37Oct 9, 2025Updated 4 months ago
- ☆20Oct 13, 2024Updated last year
- (NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models☆22Nov 20, 2024Updated last year
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆45Jan 25, 2026Updated last month
- ☆19May 17, 2025Updated 9 months ago
- [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications☆52Oct 30, 2025Updated 4 months ago
- ☆58Feb 27, 2025Updated last year
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆24Sep 26, 2024Updated last year
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆42Aug 25, 2025Updated 6 months ago
- ☆30Jan 18, 2026Updated last month
- ☆47Oct 2, 2025Updated 4 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- Code release for AccDiffusionV2 (TPAMI)☆35Nov 4, 2025Updated 3 months ago
- ☆35Feb 10, 2025Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- ☆55Jul 7, 2025Updated 7 months ago
- ☆43Aug 15, 2025Updated 6 months ago
- ☆39Aug 4, 2025Updated 6 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- [ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".☆35Dec 6, 2025Updated 2 months ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Jul 17, 2024Updated last year
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- ☆18Jun 10, 2025Updated 8 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆30Nov 24, 2024Updated last year
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆36Nov 17, 2024Updated last year
- XmodelLM☆38Nov 19, 2024Updated last year