mustafamariam / LLM-Connections-SolverLinks
Code for Columbia University COMS 3997 – LLM Ethics and Foundations
☆14Updated 6 months ago
Alternatives and similar repositories for LLM-Connections-Solver
Users that are interested in LLM-Connections-Solver are comparing it to the libraries listed below
Sorting:
- Inference-time scaling for LLMs-as-a-judge.☆251Updated this week
- Small, simple agent task environments for training and evaluation☆18Updated 8 months ago
- ☆99Updated 4 months ago
- A toolkit for describing model features and intervening on those features to steer behavior.☆193Updated 8 months ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 9 months ago
- Open source interpretability artefacts for R1.☆154Updated 2 months ago
- ☆134Updated 3 months ago
- ☆92Updated 2 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆99Updated this week
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆149Updated 5 months ago
- Draw more samples☆192Updated last year
- Plotting (entropy, varentropy) for small LMs☆97Updated 2 months ago
- Sphynx Hallucination Induction☆53Updated 5 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 11 months ago
- Evals meant to evaluate language models' ability to reason over long contexts.☆10Updated 10 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 6 months ago
- ☆171Updated 4 months ago
- ☆116Updated 6 months ago
- METR Task Standard☆154Updated 5 months ago
- smolLM with Entropix sampler on pytorch☆150Updated 8 months ago
- ☆24Updated 8 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆240Updated 5 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 5 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆173Updated 4 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆177Updated this week
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?☆196Updated this week
- ☆70Updated this week
- The history files when recording human interaction while solving ARC tasks☆113Updated this week
- ⚖️ Awesome LLM Judges ⚖️☆107Updated 2 months ago