leo-liuzy / CodeUpdateArenaLinks

☆13

Alternatives and similar repositories for CodeUpdateArena

Users that are interested in CodeUpdateArena are comparing it to the libraries listed below

Sorting:

Nanami18 / Snowballed_Hallucination
☆44Updated 9 months ago
zjysteven / mink-plus-plus
[ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs
☆37Updated last week
PrasannS / rlhf-length-biases
☆28Updated last year
allenai / hyperdecoders
Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304
☆11Updated 2 years ago
ekinakyurek / influence
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆38Updated 2 years ago
allenai / noncompliance
This repository contains data, code and models for contextual noncompliance.
☆22Updated 10 months ago
SimengSun / ChapterBreak
☆11Updated last year
nathanhu0 / CaMeLS
Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.
☆25Updated last year
shadowkiller33 / Contrast-Instruction
☆19Updated last year
declare-lab / resta
Restore safety in fine-tuned language models through task arithmetic
☆28Updated last year
yuzhaouoe / pretraining-data-packing
[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training
☆21Updated 9 months ago
chtmp223 / suri
Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)
☆22Updated 6 months ago
hitz-zentroa / lm-contamination
The LM Contamination Index is a manually created database of contamination evidences for LMs.
☆78Updated last year
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆43Updated last year
gmftbyGMFTBY / Rep-Dropout
[NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
☆31Updated last year
XiangLi1999 / AutoBencher
☆29Updated 10 months ago
abertsch72 / long-context-icl
Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"
☆35Updated 9 months ago
JasonForJoy / Model-Editing-Hurt
EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue
☆35Updated last week
eric-mitchell / serac
Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model
☆68Updated 2 years ago
SALT-NLP / chain-of-thought-bias
☆26Updated 8 months ago
kernelmachine / demix
DEMix Layers for Modular Language Modeling
☆53Updated 3 years ago
pillowsofwind / Course-Correction
[EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"
☆19Updated 8 months ago
McGill-NLP / AdversarialTriggers
Code for "Universal Adversarial Triggers Are Not Universal."
☆17Updated last year
hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆59Updated last year
niansong1996 / lever
Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)
☆87Updated last year
ejones313 / auditing-llms
☆54Updated 2 years ago
qtli / GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆62Updated 10 months ago
martin-wey / CodeUltraFeedback
CodeUltraFeedback: aligning large language models to coding preferences
☆71Updated 11 months ago
CriticBench / CriticBench
[ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
☆25Updated last year
jzbjyb / lm-calibration
☆34Updated 3 years ago