mustafamariam / LLM-Connections-SolverLinks

Code for Columbia University COMS 3997 – LLM Ethics and Foundations

☆14

Alternatives and similar repositories for LLM-Connections-Solver

Users that are interested in LLM-Connections-Solver are comparing it to the libraries listed below

Sorting:

JoshuaPurtell / SmallBench
Small, simple agent task environments for training and evaluation
☆18Updated 7 months ago
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆87Updated 8 months ago
kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆148Updated 4 months ago
joshuacnf / Ctrl-G
☆86Updated 5 months ago
google-deepmind / mishax
☆134Updated 2 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆63Updated 2 months ago
LoryPack / LLM-LieDetector
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
☆70Updated last year
allenai / infinigram-api
☆61Updated 3 weeks ago
METR / vivaria
Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
☆96Updated this week
ZeroSumEval / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆32Updated 2 months ago
haizelabs / sphynx
Sphynx Hallucination Induction
☆54Updated 4 months ago
steering-vectors / steering-vectors
Steering vectors for transformer language models in Pytorch / Huggingface
☆108Updated 4 months ago
emergent-misalignment / emergent-misalignment
☆156Updated 3 months ago
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆149Updated 2 months ago
callummcdougall / sae_visualizer
☆28Updated last year
METR / RE-Bench
☆87Updated 2 months ago
xu3kev / BARC
Bootstrapping ARC
☆127Updated 7 months ago
haizelabs / bijection-learning
☆23Updated 8 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆57Updated 9 months ago
SalesforceAIResearch / LaTRO
☆115Updated 4 months ago
ApolloResearch / e2e_sae
Sparse Autoencoder Training Library
☆52Updated last month
TransluceAI / observatory
A toolkit for describing model features and intervening on those features to steer behavior.
☆190Updated 7 months ago
JoshEngels / MultiDimensionalFeatures
Code for reproducing our paper "Not All Language Model Features Are Linear"
☆75Updated 7 months ago
LeonGuertler / TextArena
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
☆184Updated this week
jerber / lang-jepa
☆114Updated 6 months ago
leap-laboratories / PIZZA
An attribution library for LLMs
☆41Updated 9 months ago
EleutherAI / elk
Keeping language models honest by directly eliciting knowledge encoded in their activations.
☆207Updated 2 weeks ago
google-deepmind / latent-multi-hop-reasoning
[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?
☆68Updated 3 months ago
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆102Updated 2 months ago
amack315 / unsupervised-steering-vectors
☆31Updated last year