okhat / blog
☆262Updated 3 months ago
Alternatives and similar repositories for blog:
Users that are interested in blog are comparing it to the libraries listed below
- A bibliography and survey of the papers surrounding o1☆1,042Updated 2 months ago
- This repository collects all relevant resources about interpretability in LLMs☆305Updated 2 months ago
- Sparse autoencoders☆407Updated this week
- GPT4 based personalized ArXiv paper assistant bot☆500Updated 9 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆197Updated 3 months ago
- A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning mate…☆211Updated 4 months ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆170Updated 5 months ago
- ☆404Updated 5 months ago
- [ICML 2024] CLLMs: Consistency Large Language Models☆368Updated 2 months ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆278Updated last month
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.☆286Updated this week
- A Telegram bot to recommend arXiv papers☆219Updated last week
- ☆104Updated last month
- A brief and partial summary of RLHF algorithms.☆89Updated last month
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆120Updated last month
- AnchorAttention: Improved attention for LLMs long-context training☆202Updated this week
- ☆199Updated 3 weeks ago
- For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.☆82Updated this week
- Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).☆270Updated 9 months ago
- Representation Engineering: A Top-Down Approach to AI Transparency☆775Updated 5 months ago
- Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆97Updated 2 weeks ago
- System 2 Reasoning Link Collection☆722Updated this week
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆175Updated last month
- Training Sparse Autoencoders on Language Models☆573Updated this week
- ☆135Updated this week
- RewardBench: the first evaluation tool for reward models.☆491Updated last week
- Training Large Language Model to Reason in a Continuous Latent Space☆388Updated this week
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆202Updated this week
- Building a comprehensive and handy list of papers for GUI agents☆163Updated last week