Raibows/CREAM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Raibows/CREAM)

Raibows / CREAM

Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.

☆29

Alternatives and similar repositories for CREAM

Users that are interested in CREAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YiyangZhou / CSR
View on GitHub
[NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models
☆87Oct 26, 2025Updated 8 months ago
mandyyyyii / east
View on GitHub
☆19Aug 4, 2025Updated 11 months ago
huaxiuyao / KGML
View on GitHub
KGML for EMNLP 2021
☆10Feb 2, 2022Updated 4 years ago
Kuaishou-OneRec / KSA
View on GitHub
Kwai Summary Attention
☆57May 8, 2026Updated 2 months ago
LAMDA-NeSy / Self-Backtracking
View on GitHub
☆52Feb 12, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
huaxiuyao / HSML_Dynamic
View on GitHub
HSML Dynamic version for ICML 2019
☆12Jul 11, 2019Updated 7 years ago
OscarXZQ / delta_activations
View on GitHub
Official code release for Delta Activations: A Representation for Finetuned Large Language Models
☆20Sep 5, 2025Updated 10 months ago
Raibows / Learn-to-Reason
View on GitHub
Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023
☆37Dec 12, 2023Updated 2 years ago
HKUST-KnowComp / NAACL
View on GitHub
The official codebase for our paper "NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems"
☆24Feb 28, 2026Updated 4 months ago
ytyz1307zzh / IHEval
View on GitHub
Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"
☆18Feb 25, 2025Updated last year
keven980716 / weak-to-strong-deception
View on GitHub
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
☆15Jun 21, 2024Updated 2 years ago
Raibows / DynamicBatchSampler
View on GitHub
Yet another dynamic batch sampler for variable sequence data in PyTorch.
☆13Dec 9, 2021Updated 4 years ago
NUS-HPC-AI-Lab / Recurrent-Parameter-Generation
View on GitHub
The official implementation of Recurrent Diffusion for Large-Scale Parameter Generation.
☆81Sep 24, 2025Updated 9 months ago
Yui010206 / Adaptive-Visual-Imagination-Control
View on GitHub
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning
☆18Jun 2, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Oxen-AI / Self-Rewarding-Language-Models
View on GitHub
This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.
☆135Nov 16, 2024Updated last year
NeuSpeech / NeuGPT
View on GitHub
First neural GPT aligned with text and speech. Welcome to join us to make better foundation model in neural modality.
☆14Oct 30, 2024Updated last year
WPR001 / Ego-ST
View on GitHub
☆16Sep 25, 2025Updated 9 months ago
w-yibo / R1-Compress
View on GitHub
[NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search
☆17Jan 24, 2026Updated 5 months ago
OpenGVLab / TPO
View on GitHub
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment
☆65Jul 22, 2025Updated 11 months ago
gl-ybnbxb / BoNBoN
View on GitHub
☆19Jun 3, 2024Updated 2 years ago
satori-reasoning / Satori
View on GitHub
[ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
☆114Jun 3, 2025Updated last year
beichenzbc / BoostStep
View on GitHub
official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"
☆37Jan 21, 2025Updated last year
Snowflake-Labs / agent-world-model
View on GitHub
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
☆412May 28, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
phantomzone-org / idash-2024-solution
View on GitHub
☆10Nov 1, 2024Updated last year
alexrs / herd
View on GitHub
Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.
☆11Feb 11, 2024Updated 2 years ago
ranran0523 / SPECNN
View on GitHub
code repo for paper accepted in ICML 2023
☆13Oct 19, 2023Updated 2 years ago
GATECH-EIC / ACT
View on GitHub
[ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…
☆45Jun 30, 2024Updated 2 years ago
JiayuJeff / CostBench
View on GitHub
The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments…
☆33Jun 14, 2026Updated last month
sparkle-reasoning / sparkle
View on GitHub
[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning
☆16Dec 12, 2025Updated 7 months ago
anhtuanhsgs / GitMerge3D
View on GitHub
[NeurIPS 2025] How Many Tokens Do 3D Point Cloud Transformer Architectures Really Need?
☆43Nov 21, 2025Updated 8 months ago
wxr99 / HolisticPU
View on GitHub
Beyond Myopia: Learning from Positive and Unlabeled Data through Holistic Predictive Trends [NeurIPS 2023]
☆10Jan 28, 2024Updated 2 years ago
aiming-lab / CITER
View on GitHub
[COLM'25] CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing
☆19Jun 25, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ventr1c / memma
View on GitHub
The official repository of "MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution".
☆19Mar 20, 2026Updated 4 months ago
THUDM / ReST-RL
View on GitHub
Reinforcing LLM Reasoning through Self-Training and Value-Guided Decoding
☆18May 6, 2026Updated 2 months ago
JiayuJeff / PlanBench-XL
View on GitHub
Official Repository for our paper: PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems
☆38Updated this week
huaxiuyao / ATS
View on GitHub
ATS for NeurIPS 2021
☆24Nov 4, 2021Updated 4 years ago
mengzaiqiao / awesome-natural-language-reasoning
View on GitHub
A collection of research papers related to Natural Language Reasoning
☆10May 27, 2022Updated 4 years ago
sail-sg / variational-reasoning
View on GitHub
Code for "Variational Reasoning for Language Models"
☆60Sep 29, 2025Updated 9 months ago
sail-sg / dice
View on GitHub
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
☆47Apr 15, 2025Updated last year