normster/llm_rules

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/normster/llm_rules)

normster / llm_rules

RuLES: a benchmark for evaluating rule-following in language models

☆255

Alternatives and similar repositories for llm_rules

Users that are interested in llm_rules are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aypan17 / reward-misspecification
View on GitHub
☆10Mar 13, 2023Updated 3 years ago
ash-01xor / bpe.c
View on GitHub
Simple Byte pair Encoding mechanism used for tokenization process . written purely in C
☆151Nov 11, 2024Updated last year
abacaj / fine-tune-mistral
View on GitHub
Fine-tune mistral-7B on 3090s, a100s, h100s
☆734Oct 11, 2023Updated 2 years ago
isafulf / inbox_cleaner
View on GitHub
A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.
☆465Dec 2, 2023Updated 2 years ago
carlini / yet-another-applied-llm-benchmark
View on GitHub
A benchmark to evaluate language models on questions I've previously asked them to solve.
☆1,061Apr 27, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chawins / pal
View on GitHub
PAL: Proxy-Guided Black-Box Attack on Large Language Models
☆57Aug 17, 2024Updated last year
gautierdag / bpeasy
View on GitHub
Fast bare-bones BPE for modern tokenizer training
☆179Jun 23, 2025Updated last year
aogara-ds / hoodwinked-website
View on GitHub
A text-based game where language models learn to lie and to detect lies.
☆12Oct 4, 2023Updated 2 years ago
HumanCompatibleAI / tensor-trust-data
View on GitHub
Dataset for the Tensor Trust project
☆49Mar 17, 2024Updated 2 years ago
allenai / easy-to-hard-generalization
View on GitHub
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Jan 17, 2024Updated 2 years ago
teichman / teichman-ros-pkg
View on GitHub
☆10Sep 30, 2015Updated 10 years ago
belindal / LaMPP
View on GitHub
Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action
☆37Apr 3, 2023Updated 3 years ago
stevemayne / gplusblog
View on GitHub
Google+ Blog
☆15Oct 9, 2011Updated 14 years ago
ruiqi-zhong / nlparam
View on GitHub
Augmenting Statistical Models with Natural Language Parameters
☆28Sep 17, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
meta-pytorch / gpt-fast
View on GitHub
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
☆6,228Aug 22, 2025Updated 10 months ago
lm-sys / llm-decontaminator
View on GitHub
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆323Dec 20, 2023Updated 2 years ago
liyucheng09 / LatestEval
View on GitHub
Latest Evaluation Toolkit (LatestEval). Assessing the language models with latest, uncontaminated materials.
☆29Feb 17, 2025Updated last year
ethz-spylab / rlhf_trojan_competition
View on GitHub
Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.
☆119Jun 13, 2024Updated 2 years ago
AsaCooperStickland / situational-awareness-evals
View on GitHub
Measuring the situational awareness of language models
☆41Feb 12, 2024Updated 2 years ago
storborg / glass-teardown
View on GitHub
Teardown of Google Glass
☆40Jan 11, 2014Updated 12 years ago
Codium-ai / AlphaCodium
View on GitHub
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""
☆3,945Nov 25, 2024Updated last year
paul-rottger / xstest
View on GitHub
Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"
☆138Feb 24, 2025Updated last year
isle-dev / MetricEval
View on GitHub
MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…
☆12Nov 6, 2023Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
janphilippfranken / sami
View on GitHub
Self-Supervised Alignment with Mutual Information
☆20May 24, 2024Updated 2 years ago
sail-sg / Cheating-LLM-Benchmarks
View on GitHub
[ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)
☆86Oct 23, 2024Updated last year
BobMcDear / attorch
View on GitHub
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
☆606May 13, 2026Updated last month
clu0 / unet.cu
View on GitHub
UNet diffusion model in pure CUDA
☆660Jun 28, 2024Updated 2 years ago
yale-nlp / InstruSum
View on GitHub
☆23Feb 26, 2024Updated 2 years ago
hkust-nlp / felm
View on GitHub
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆65Dec 25, 2023Updated 2 years ago
microsoft / Samba
View on GitHub
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆964Nov 16, 2025Updated 7 months ago
ryoungj / ToolEmu
View on GitHub
[ICLR'24 Spotlight] A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use
☆211Mar 22, 2024Updated 2 years ago
AI45Lab / CodeAttack
View on GitHub
[ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion
☆61Oct 1, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
subhashk01 / LLM-addition
View on GitHub
LLMs represent numbers on a helix and manipulate that helix to do addition.
☆31Feb 4, 2025Updated last year
rookie-joe / AutoPSV
View on GitHub
☆50Oct 28, 2024Updated last year
facebookresearch / Shepherd
View on GitHub
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆224Aug 10, 2023Updated 2 years ago
abhika-m / FAVA
View on GitHub
☆77Feb 16, 2024Updated 2 years ago
anthropics / hh-rlhf
View on GitHub
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
☆1,847Jun 17, 2025Updated last year
petezh / OpenD5
View on GitHub
Tasks for describing differences between text distributions.
☆17Aug 9, 2024Updated last year
prateeky2806 / ComPEFT
View on GitHub
☆26Nov 23, 2023Updated 2 years ago