Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.
☆28Feb 17, 2025Updated last year
Alternatives and similar repositories for CREAM
Users that are interested in CREAM are comparing it to the libraries listed below
Sorting:
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆17Feb 25, 2025Updated last year
- ☆52Feb 12, 2025Updated last year
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆86Oct 26, 2025Updated 4 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Jan 21, 2025Updated last year
- Repository for awesome spatial/visual reasoning MLLMs. (focus more on embodied applications)☆72Jun 26, 2025Updated 8 months ago
- ☆13Jan 14, 2026Updated last month
- Official implementation for “HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Opt…☆25Jan 10, 2026Updated last month
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- ☆13Aug 28, 2024Updated last year
- ☆10Jun 17, 2021Updated 4 years ago
- Official Implementation of HIMA (COLM'25)☆19Nov 25, 2025Updated 3 months ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 6 months ago
- Hierarchical Attention Network based Explainable Knowledge Tracing☆10May 18, 2022Updated 3 years ago
- Image reconstruction from human brain activity by VAE and adversarial learning☆12May 21, 2022Updated 3 years ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- [ICML 2025] Repository for M3-JEPA: Multimodal Alignment via Multi-gate MoE based on the Joint-Predictive Embedding Architecture☆18Nov 4, 2025Updated 3 months ago
- Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023☆12Aug 24, 2025Updated 6 months ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- 🕵️♂️🔊 Automatically update Audio Deepfake Detection (ADD) papers daily using GitHub Actions (updates every 12 hours)☆17Feb 13, 2026Updated 2 weeks ago
- [ICCV 2023] Code for "Multi-task View Synthesis with Neural Radiance Fields"☆11Oct 2, 2023Updated 2 years ago
- Fine-tuning Llama2-7b and other llms for categorising emails for Deutsche Bahn (German National Railways)☆13Oct 9, 2023Updated 2 years ago
- Speech Security and Privacy Compendium - Mini☆10Jun 18, 2024Updated last year
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 3 weeks ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆57Oct 10, 2025Updated 4 months ago
- 利用双目摄像头拍摄图像,完成了相机标定;立体匹配;深度图、点云图生成;平面分割;凹陷检测。训练了pytorch二分类模型,对完好/破损快递进行分类。☆11Oct 15, 2023Updated 2 years ago
- ☆15Dec 2, 2025Updated 3 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆46Jun 30, 2024Updated last year
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆108Jun 3, 2025Updated 8 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆116Feb 9, 2024Updated 2 years ago
- ☆10Nov 1, 2024Updated last year
- A collection of research papers related to Natural Language Reasoning☆11May 27, 2022Updated 3 years ago
- Beyond Myopia: Learning from Positive and Unlabeled Data through Holistic Predictive Trends [NeurIPS 2023]☆10Jan 28, 2024Updated 2 years ago
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 4 months ago
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆16Feb 15, 2025Updated last year
- First neural GPT aligned with text and speech. Welcome to join us to make better foundation model in neural modality.☆14Oct 30, 2024Updated last year
- ☆12Apr 12, 2024Updated last year
- KGML for EMNLP 2021☆10Feb 2, 2022Updated 4 years ago
- [ICLR 2025] No Preference Left Behind: Group Distributional Preference Optimization☆14Apr 21, 2025Updated 10 months ago