srush / awesome-o1
A bibliography and survey of the papers surrounding o1
☆643Updated this week
Related projects ⓘ
Alternatives and complementary repositories for awesome-o1
- RewardBench: the first evaluation tool for reward models.☆426Updated 2 weeks ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆739Updated last week
- This repository collects all relevant resources about interpretability in LLMs☆283Updated last week
- System 2 Reasoning Link Collection☆686Updated 2 weeks ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆644Updated last month
- What would you do with 1000 H100s...☆895Updated 10 months ago
- Minimalistic large language model 3D-parallelism training☆1,229Updated last week
- Building blocks for foundation models.☆388Updated 10 months ago
- ☆1,263Updated this week
- Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions☆633Updated this week
- Annotated version of the Mamba paper☆455Updated 8 months ago
- A library for advanced large language model reasoning☆1,420Updated 2 months ago
- ☆920Updated this week
- Training Sparse Autoencoders on Language Models☆448Updated this week
- ☆320Updated 3 months ago
- Scalable toolkit for efficient model alignment☆614Updated this week
- Representation Engineering: A Top-Down Approach to AI Transparency☆721Updated 2 months ago
- Sparse autoencoders☆336Updated 3 weeks ago
- Transformers with Arbitrarily Large Context☆637Updated 3 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆711Updated last month
- Website for hosting the Open Foundation Models Cheat Sheet.☆255Updated 4 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆435Updated 7 months ago
- Official repository for ORPO☆419Updated 5 months ago
- GPT4 based personalized ArXiv paper assistant bot☆487Updated 7 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆704Updated last week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆788Updated this week
- A repository for research on medium sized language models.☆479Updated last week
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆553Updated 8 months ago
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆801Updated 2 months ago