JacobPfau / fillerTokens
☆60Updated last year
Alternatives and similar repositories for fillerTokens
Users that are interested in fillerTokens are comparing it to the libraries listed below
Sorting:
- Replicating O1 inference-time scaling laws☆85Updated 5 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆47Updated last year
- ☆31Updated 4 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆54Updated last year
- ☆78Updated 8 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆29Updated last month
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆95Updated last week
- Language models scale reliably with over-training and on downstream tasks☆97Updated last year
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study☆49Updated 5 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 3 months ago
- ☆171Updated 3 weeks ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆27Updated 7 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆101Updated 2 months ago
- ☆46Updated 2 months ago
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆59Updated last year
- Exploration of automated dataset selection approaches at large scales.☆40Updated 2 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆108Updated this week
- ☆25Updated 4 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆134Updated 7 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆83Updated last year
- Codebase for Instruction Following without Instruction Tuning☆34Updated 7 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆82Updated last week
- Long Context Extension and Generalization in LLMs☆55Updated 7 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆84Updated 7 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆79Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- ☆114Updated 2 months ago
- ☆97Updated 10 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆75Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆43Updated 5 months ago