stellalisy / PrefPaletteView external linksLinks
☆21Jul 21, 2025Updated 6 months ago
Alternatives and similar repositories for PrefPalette
Users that are interested in PrefPalette are comparing it to the libraries listed below
Sorting:
- Fork of Flame repo for training of some new stuff in development☆19Jan 5, 2026Updated last month
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆28Jul 14, 2025Updated 7 months ago
- ☆14Mar 20, 2025Updated 10 months ago
- FlexiTokens☆19Dec 27, 2025Updated last month
- ☆23Sep 19, 2024Updated last year
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Jul 1, 2025Updated 7 months ago
- CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation☆15Aug 19, 2025Updated 5 months ago
- ☆14Oct 4, 2024Updated last year
- ☆19Jun 4, 2025Updated 8 months ago
- ☆19Aug 4, 2025Updated 6 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆17Oct 17, 2025Updated 3 months ago
- The dataset consists of public social media url pairs and the corresponding entailment label for an external conference (ACL 2021). Each …☆14Aug 16, 2021Updated 4 years ago
- ☆16Apr 30, 2024Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated 11 months ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆31Jun 5, 2025Updated 8 months ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Jun 13, 2024Updated last year
- R3: Robust Rubric-Agnostic Reward Models☆20Jul 12, 2025Updated 7 months ago
- Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"☆26Nov 20, 2025Updated 2 months ago
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- ☆21May 3, 2025Updated 9 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆159Jun 26, 2025Updated 7 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆21Nov 9, 2025Updated 3 months ago
- ☆17Aug 1, 2025Updated 6 months ago
- Multimodal RewardBench☆61Feb 21, 2025Updated 11 months ago
- Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…☆25Dec 20, 2024Updated last year
- ☆60Jan 8, 2026Updated last month
- ☆29Nov 9, 2025Updated 3 months ago
- ☆45May 27, 2025Updated 8 months ago
- Code for paper "Analog Foundation Models"☆30Sep 18, 2025Updated 4 months ago
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆56Jan 27, 2025Updated last year
- Resa: Transparent Reasoning Models via SAEs☆47Sep 23, 2025Updated 4 months ago
- ☆47Oct 2, 2025Updated 4 months ago
- Official code of "StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs".☆74Jun 23, 2025Updated 7 months ago
- ☆67Mar 6, 2025Updated 11 months ago
- The repository contains code for Adaptive Data Optimization☆32Dec 9, 2024Updated last year
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆50Jan 5, 2026Updated last month
- A Text2SQL benchmark for evaluation of Large Language Models☆41Feb 8, 2026Updated last week
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago