lil-lab / lm-classLinks
Materials for a language modeling class, broadly construed
☆31Updated this week
Alternatives and similar repositories for lm-class
Users that are interested in lm-class are comparing it to the libraries listed below
Sorting:
- ☆92Updated 3 weeks ago
- Open Implementations of LLM Analyses☆107Updated last year
- ☆63Updated last year
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆62Updated 9 months ago
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- Commit0: Library Generation from Scratch☆175Updated 8 months ago
- Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University☆283Updated 3 weeks ago
- Official repo for Learning to Reason for Long-Form Story Generation☆73Updated 8 months ago
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆118Updated last month
- Composable inference algorithms with LLMs and programmable logic☆69Updated last year
- ☆42Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆47Updated last year
- Open source interpretability artefacts for R1.☆167Updated 8 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- Code for the paper "Fishing for Magikarp"☆178Updated 8 months ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆191Updated this week
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆234Updated 5 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆152Updated 11 months ago
- Fluid Language Model Benchmarking☆25Updated 4 months ago
- ☆151Updated 4 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆126Updated 3 months ago
- Evaluating LLMs with CommonGen-Lite☆93Updated last year
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 5 months ago
- Evaluation of LLMs on latest math competitions☆212Updated 3 weeks ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Updated 4 months ago
- ☆261Updated 9 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated last year
- A toolkit for describing model features and intervening on those features to steer behavior.☆225Updated last month
- Evaluating LLMs with fewer examples☆169Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year