okhat / blog
☆233Updated last month
Related projects ⓘ
Alternatives and complementary repositories for blog
- A bibliography and survey of the papers surrounding o1☆754Updated this week
- ☆328Updated 4 months ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆160Updated 3 months ago
- This repository collects all relevant resources about interpretability in LLMs☆288Updated 2 weeks ago
- Representation Engineering: A Top-Down Approach to AI Transparency☆728Updated 3 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆174Updated last month
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆181Updated this week
- RewardBench: the first evaluation tool for reward models.☆431Updated 3 weeks ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.☆217Updated this week
- Extracting spatial and temporal world models from LLMs☆244Updated last year
- GPT4 based personalized ArXiv paper assistant bot☆488Updated 7 months ago
- ☆247Updated 5 months ago
- Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).☆242Updated 7 months ago
- System 2 Reasoning Link Collection☆693Updated 3 weeks ago
- large population models☆214Updated 3 weeks ago
- Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions☆641Updated 2 weeks ago
- Sparse autoencoders☆342Updated last week
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆307Updated 7 months ago
- ☆72Updated 4 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆744Updated this week
- ☆190Updated 3 months ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆322Updated 5 months ago
- Some preliminary explorations of Mamba's context scaling.☆191Updated 9 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆448Updated 8 months ago
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆428Updated 6 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆164Updated this week
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆158Updated 4 months ago
- A Survey on Data Selection for Language Models☆182Updated last month
- OLMoE: Open Mixture-of-Experts Language Models☆460Updated this week