okhat / blog
☆276Updated 6 months ago
Alternatives and similar repositories for blog:
Users that are interested in blog are comparing it to the libraries listed below
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆304Updated 5 months ago
- A brief and partial summary of RLHF algorithms.☆127Updated last month
- Paper list for Efficient Reasoning.☆403Updated this week
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆227Updated last month
- A platform for developers to simulate collaborative research activities☆146Updated this week
- A Survey on Efficient Reasoning for LLMs☆332Updated this week
- Paper List of Inference/Test Time Scaling/Computing☆195Updated last week
- A bibliography and survey of the papers surrounding o1☆1,187Updated 5 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆144Updated last month
- GPT4 based personalized ArXiv paper assistant bot☆516Updated last year
- Understanding R1-Zero-Like Training: A Critical Perspective☆882Updated last week
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆419Updated 2 weeks ago
- [ICML 2024] CLLMs: Consistency Large Language Models☆390Updated 5 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆162Updated last week
- ☆144Updated 5 months ago
- ☆453Updated 9 months ago
- This repository collects all relevant resources about interpretability in LLMs☆341Updated 5 months ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆185Updated 8 months ago
- ☆166Updated last week
- AnchorAttention: Improved attention for LLMs long-context training☆206Updated 3 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆331Updated last week
- Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆128Updated 3 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆182Updated 2 weeks ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond