wwxu21/CUT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wwxu21/CUT)

wwxu21 / CUT

Source code of "Reasons to Reject? Aligning Language Models with Judgments"

☆58

Alternatives and similar repositories for CUT

Users that are interested in CUT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wwxu21 / AMR-SG
View on GitHub
☆20Sep 17, 2021Updated 4 years ago
DAMO-NLP-SG / Auto-Arena-LLMs
View on GitHub
☆44Oct 7, 2024Updated last year
DAMO-NLP-SG / MT-LLaMA
View on GitHub
Multi-Task instruction-tuned LLaMA
☆14May 5, 2023Updated 3 years ago
halfrot / ALaRM
View on GitHub
[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"
☆25Mar 28, 2024Updated 2 years ago
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DAMO-NLP-SG / CLEX
View on GitHub
[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
☆78Mar 12, 2024Updated 2 years ago
CriticBench / CriticBench
View on GitHub
[ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
☆31Mar 5, 2024Updated 2 years ago
qtli / GSM-Plus
View on GitHub
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆66Jul 8, 2024Updated 2 years ago
DAMO-NLP-SG / RemeMo
View on GitHub
[EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning
☆17Oct 31, 2023Updated 2 years ago
GAIR-NLP / BeHonest
View on GitHub
BeHonest: Benchmarking Honesty in Large Language Models
☆35Aug 15, 2024Updated last year
yifeiwang77 / Self-Correction
View on GitHub
☆20Nov 3, 2024Updated last year
LeonCrashCode / InOrderParser
View on GitHub
TACL 2017
☆27Nov 29, 2017Updated 8 years ago
gao-xiao-bai / JsonTuning
View on GitHub
JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning
☆10Nov 3, 2024Updated last year
GAIR-NLP / OPO
View on GitHub
☆50Mar 2, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
LHRYANG / FSD
View on GitHub
Implementation of LREC-COLING 2024 paper A Frustratingly Simple Decoding Method for Neural Text Generation
☆19Feb 23, 2024Updated 2 years ago
ffaltings / InteractiveTextGeneration
View on GitHub
☆34Mar 25, 2023Updated 3 years ago
GXimingLu / IPA
View on GitHub
Codebase for Inference-Time Policy Adapters
☆25Nov 3, 2023Updated 2 years ago
KodCode-AI / code-r1
View on GitHub
Reproducing R1 for Code with Reliable Rewards
☆13Apr 9, 2025Updated last year
purbeshmitra / MOTIF
View on GitHub
MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
☆17Jul 6, 2025Updated last year
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated last year
sustcsonglin / disco-pointer
View on GitHub
Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …
☆14Aug 25, 2023Updated 2 years ago
sanyalsunny111 / LLM-Inheritune
View on GitHub
[TMLR 2025] When Attention Collapses: How Degenerate Layers in LLMs Enable Smaller, Stronger Models
☆126Mar 6, 2026Updated 4 months ago
Alignment-Lab-AI / Dataset-Conversion-Toolkit
View on GitHub
a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for eas…
☆20Mar 14, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
DAMO-NLP-SG / CaRing
View on GitHub
Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs
☆41Feb 15, 2024Updated 2 years ago
DAMO-NLP-SG / LongPO
View on GitHub
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆43Feb 27, 2025Updated last year
GAIR-NLP / LIMOPro
View on GitHub
☆15May 27, 2025Updated last year
hkust-nlp / AgentVista
View on GitHub
Benchmarking multimodal agents on realistic, ultra-challenging visual scenarios requiring long-horizon hybrid tool use.
☆65Mar 10, 2026Updated 4 months ago
deeplearning-wisc / args
View on GitHub
☆47Feb 8, 2024Updated 2 years ago
martin-wey / CodeUltraFeedback
View on GitHub
CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)
☆76Jun 25, 2024Updated 2 years ago
kyegomez / Reka-Torch
View on GitHub
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆28Jul 13, 2026Updated last week
TingchenFu / MathIF
View on GitHub
instruction-following benchmark for large reasoning models
☆49Apr 19, 2026Updated 3 months ago
SihengLi99 / RePO
View on GitHub
RePO: Replay-Enhanced Policy Optimization
☆23Jun 12, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sustcsonglin / pointer-net-for-nested
View on GitHub
The official implementation of ACL2022``Bottom-Up Constituency Parsing and Nested Named Entity Recognition with Pointer Networks''
☆34Jan 12, 2023Updated 3 years ago
jcyk / copyisallyouneed
View on GitHub
Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory
☆81Jun 12, 2023Updated 3 years ago
JamyDon / PLM-based-CGEC-Model-Ensemble
View on GitHub
[ACL 2023] Are Pre-trained Language Models Useful for Model Ensemble in Chinese Grammatical Error Correction?
☆10Dec 15, 2025Updated 7 months ago
alibaba-damo-academy / EOCBench
View on GitHub
[NeurIPS 2025] EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocen…
☆22Jun 17, 2025Updated last year
facebookresearch / multimodal_rewardbench
View on GitHub
Multimodal RewardBench
☆68Feb 21, 2025Updated last year
izhx / uni-rep
View on GitHub
Code for embedding and retrieval research.
☆16Oct 24, 2023Updated 2 years ago
liziniu / policy_optimization
View on GitHub
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)
☆29Dec 19, 2023Updated 2 years ago