JacobPfau / fillerTokensLinks

☆67

Alternatives and similar repositories for fillerTokens

Users that are interested in fillerTokens are comparing it to the libraries listed below

Sorting:

hughbzhang / o1_inference_scaling_laws
Replicating O1 inference-time scaling laws
☆89Updated 8 months ago
da03 / Internalize_CoT_Step_by_Step
☆187Updated 3 months ago
SalesforceAIResearch / LaTRO
☆118Updated 5 months ago
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆57Updated last year
princeton-nlp / USACO
Can Language Models Solve Olympiad Programming?
☆118Updated 6 months ago
architsharma97 / dpo-rlaif
☆99Updated last year
ScalingIntelligence / large_language_monkeys
☆101Updated 10 months ago
LoryPack / LLM-LieDetector
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
☆72Updated last year
THUDM / T1
RL Scaling and Test-Time Scaling (ICML'25)
☆109Updated 6 months ago
ucl-dark / llm_debate
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
☆113Updated last year
katiekang1998 / reasoning_generalization
☆34Updated 7 months ago
Yu-Fangxu / FoR
[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples
☆104Updated 2 weeks ago
JoshEngels / MultiDimensionalFeatures
Code for reproducing our paper "Not All Language Model Features Are Linear"
☆77Updated 8 months ago
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Updated last year
imagination-research / lbt
[NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study
☆52Updated 8 months ago
RobertCsordas / moeut
☆83Updated 11 months ago
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆97Updated last year
kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆150Updated 6 months ago
SynthLabsAI / big-math
A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
☆59Updated 5 months ago
da03 / implicit_chain_of_thought
☆135Updated 8 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆33Updated 2 weeks ago
hamishivi / automated-instruction-selection
Exploration of automated dataset selection approaches at large scales.
☆47Updated 5 months ago
kyegomez / Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
☆112Updated 2 weeks ago
google-deepmind / bbeh
☆85Updated 3 months ago
huggingface / ioi
☆38Updated 4 months ago
stanfordnlp / axbench
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
☆112Updated last month
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆88Updated 10 months ago
sail-sg / VeriFree
Reinforcing General Reasoning without Verifiers
☆78Updated last month
felipemaiapolo / tinyBenchmarks
Evaluating LLMs with fewer examples
☆160Updated last year
yidingjiang / ado
The repository contains code for Adaptive Data Optimization
☆25Updated 8 months ago