chr26195/PENCIL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chr26195/PENCIL)

chr26195 / PENCIL

This is the official implementation for paper "PENCIL: Long Thoughts with Short Memory".

☆81

Alternatives and similar repositories for PENCIL

Users that are interested in PENCIL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wellecks / llemma_formal2formal
View on GitHub
Llemma formal2formal (tactic prediction) theorem proving experiments
☆20Oct 17, 2023Updated 2 years ago
bbartoldson / TBA
View on GitHub
Official implementation of TBA for async LLM post-training.
☆32Nov 5, 2025Updated 8 months ago
zhaoxlpku / SubgoalXL
View on GitHub
☆26Aug 23, 2024Updated last year
jidiai / Competition_AAMAS2023
View on GitHub
source code for AAMAS 2023 Imperfect-information Card Game Competition
☆13Mar 21, 2024Updated 2 years ago
rosieyzh / openrlhf-pretrain
View on GitHub
Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"
☆29Oct 14, 2025Updated 9 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
chr26195 / AP-MDM
View on GitHub
This is the official implementation for paper "On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond".
☆23Nov 17, 2025Updated 8 months ago
uservan / speculative_thinking
View on GitHub
☆34Oct 13, 2025Updated 9 months ago
gouki510 / Topology_of_Reasoning
View on GitHub
☆42Jun 11, 2025Updated last year
vyomakesh09 / longagent
View on GitHub
LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration
☆11Mar 11, 2024Updated 2 years ago
cadentj / caft
View on GitHub
☆25Mar 30, 2026Updated 3 months ago
hkust-nlp / RL-Verifier-Robustness
View on GitHub
From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.
☆24Oct 7, 2025Updated 9 months ago
chuanyang-Zheng / Lyra-theorem-prover
View on GitHub
The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"
☆15Jul 2, 2024Updated 2 years ago
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago
AngelaZZZ-611 / reasoning_models_probing
View on GitHub
☆22May 14, 2026Updated 2 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
shangshang-wang / Resa
View on GitHub
Resa: Transparent Reasoning Models via SAEs
☆50Sep 23, 2025Updated 10 months ago
quint-t / Puzzle-Generator-and-Solver
View on GitHub
Puzzle Generator; Einstein's Riddle, Zebra Puzzle and Blood Donation Puzzle Solver. For non-commercial use only!
☆20Mar 4, 2023Updated 3 years ago
sail-sg / Attention-Sink
View on GitHub
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆164Jul 8, 2025Updated last year
YuxiangChai / A3
View on GitHub
☆35Jan 12, 2026Updated 6 months ago
rioyokotalab / swallow-code-math
View on GitHub
Ongoing research project for code&math LLMs
☆32Jul 4, 2025Updated last year
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated last year
sjelassi / transformers_ssm_copy
View on GitHub
☆40Feb 26, 2024Updated 2 years ago
Eleanor-H / MUSTARD
View on GitHub
Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data
☆43May 29, 2024Updated 2 years ago
lee-ny / teaching_arithmetic
View on GitHub
☆84Aug 31, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
aadityasingh / icl-dynamics
View on GitHub
☆26Feb 20, 2026Updated 5 months ago
YihongDong / RL-PLUS
View on GitHub
☆27Aug 31, 2025Updated 10 months ago
bethgelab / sober-reasoning
View on GitHub
A Sober Look at Language Model Reasoning
☆92Nov 18, 2025Updated 8 months ago
Simplified-Reasoning / LUFFY
View on GitHub
Official Repository of "Learning to Reason under Off-Policy Guidance"
☆461Mar 20, 2026Updated 4 months ago
THU-KEG / PairJudgeRM
View on GitHub
☆15Apr 14, 2025Updated last year
OpenNLPLab / lightning-attention
View on GitHub
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
☆344Feb 23, 2025Updated last year
MasterVito / SvS
View on GitHub
Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training
☆54Dec 13, 2025Updated 7 months ago
cdxeve / awesome-computer-use-agents
View on GitHub
A curated list of papers, tools, and benchmarks on LLM-based computer-use agents, covering both terminal/CLI and GUI approaches.
☆16May 21, 2026Updated 2 months ago
eqimp / hogwild_llm
View on GitHub
Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache
☆142Aug 13, 2025Updated 11 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
allenai / signal-and-noise
View on GitHub
Measuring the Signal to Noise Ratio in Language Model Evaluation
☆31Aug 19, 2025Updated 11 months ago
Open-Source-O1 / o1_Reasoning_Patterns_Study
View on GitHub
☆105Dec 6, 2024Updated last year
Adversarr / LearningSparsePreconditioner4GPU
View on GitHub
Official implementation of "Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs" (NeurIPS 2025)
☆19Nov 3, 2025Updated 8 months ago
SalesforceAIResearch / swecomm
View on GitHub
☆28Jun 2, 2026Updated last month
WtaoZhao / GraphGLOW
View on GitHub
PyTorch implementation of GraphGLOW: Universal and Generalizable Structure Learning for Graph Neural Networks
☆36Jun 28, 2023Updated 3 years ago
yanzhh / HGERE
View on GitHub
Source Code for "Joint Entity and Relation Extraction with Span Pruning and Hypergraph Neural Networks"
☆33May 29, 2026Updated 2 months ago