tmlr-group / Co-RewardLinks

Co-Reward: Self-supervised RL for LLM Reasoning via Contrastive Agreement

☆32

Alternatives and similar repositories for Co-Reward

Users that are interested in Co-Reward are comparing it to the libraries listed below

Sorting:

resistzzz / Co-Reward
Co-Reward: Self-supervised RL for LLM Reasoning via Contrastive Agreement
☆26Updated this week
tmlr-group / NoisyRationales
[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"
☆35Updated 3 weeks ago
EnnengYang / RepresentationSurgery
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
☆46Updated 10 months ago
ShuheSH / A-Survey-of-the-Reasoning-Abilities-of-LLMs
☆24Updated 5 months ago
Dongping-Chen / MLLM-Judge
[ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.
☆78Updated 5 months ago
zzwjames / FailureLLMUnlearning
An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)
☆29Updated 5 months ago
EnnengYang / AdaMerging
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆88Updated 9 months ago
he-y / Multisize-Dataset-Condensation
Official PyTorch implementation of "Multisize Dataset Condensation" (ICLR'24 Oral)
☆14Updated last year
which47 / LLMCL
Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning
☆34Updated 8 months ago
AI45Lab / REEF
The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source…
☆59Updated 6 months ago
JasonForJoy / Model-Editing-Hurt
EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue
☆35Updated 2 months ago
ycjing / Awesome-Model-Merging
A curated list of Model Merging methods.
☆92Updated 10 months ago
tmlr-group / EOE
[ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"
☆12Updated 6 months ago
bethgelab / sober-reasoning
A Sober Look at Language Model Reasoning
☆81Updated last month
sail-sg / Cheating-LLM-Benchmarks
[ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)
☆81Updated 9 months ago
arumaekawa / DiLM
Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".
☆22Updated 6 months ago
QingyangZhang / EMPO
EMPO, A Fully Unsupervised RLVR Method
☆56Updated last week
tanganke / weight-ensembling_MoE
Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"
☆29Updated last year
tmlr-group / G-effect
[ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"
☆12Updated 5 months ago
gzcch / Bingo
☆55Updated last year
cliang1453 / task-aware-distillation
Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)
☆35Updated last year
OpenKG-ORG / EasyDetect
An Easy-to-use Hallucination Detection Framework for LLMs.
☆60Updated last year
keven980716 / weak-to-strong-deception
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
☆13Updated last year
shenlei515 / VHL-paddle
translation of VHL repo in paddle
☆25Updated 2 years ago
SophieZheng998 / ALI-Agent
Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"
☆19Updated last week
deeplearning-wisc / haloscope
source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"
☆52Updated 4 months ago
Pbihao / SLM
☆28Updated last year
JayZhang42 / SLED
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433
☆28Updated 8 months ago
Ahren09 / AgentReview
Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."
☆83Updated 9 months ago
VITA-Group / Robust_Weight_Signatures
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
☆16Updated 2 years ago