LUMIA-Group/PonderingLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LUMIA-Group/PonderingLM)

LUMIA-Group / PonderingLM

Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"

☆26

Alternatives and similar repositories for PonderingLM

Users that are interested in PonderingLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HLTCHKUST / UniVaR
View on GitHub
Official reposity for paper "High-Dimension Human Value Representation in Large Language Models" (NAACL'25 Main)
☆23Jul 9, 2024Updated 2 years ago
EvanZhuang / mixinputs
View on GitHub
Official implementation for Text Generation Beyond Discrete Token Sampling
☆26Aug 11, 2025Updated 11 months ago
MingyuJ666 / Disentangling-Memory-and-Reasoning
View on GitHub
[ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.
☆87Nov 2, 2025Updated 8 months ago
zz1358m / ATP-Latent-master
View on GitHub
☆17Feb 4, 2026Updated 5 months ago
DJC-GO-SOLO / Latent-SFT
View on GitHub
Official implementation of Latent-SFT: teaching LLMs to reason with vocabulary-space latent chains.
☆55May 18, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
shawnricecake / Heima
View on GitHub
[ICML 2026] Heima
☆75May 20, 2026Updated 2 months ago
zhenyi4 / codi
View on GitHub
Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"
☆102Dec 15, 2025Updated 7 months ago
LUMIA-Group / ConceptLM
View on GitHub
Official Implementation of ConceptLM.
☆23Mar 18, 2026Updated 4 months ago
nasosger / MuToR
View on GitHub
[NeurIPS '25] Multi-Token Prediction Needs Registers
☆30Dec 14, 2025Updated 7 months ago
xiaomi-research / colar
View on GitHub
[NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
☆97Jun 29, 2026Updated 3 weeks ago
jins7 / LatentEvolve
View on GitHub
☆27Oct 9, 2025Updated 9 months ago
mcleish7 / retrofitting-recurrence
View on GitHub
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
☆68Nov 11, 2025Updated 8 months ago
ixaxaar / pytorch-dni
View on GitHub
Decoupled Neural Interfaces Using Synthetic Gradients - under develeopment
☆11Jun 27, 2025Updated last year
ernoult / targetProp
View on GitHub
Testing Difference Target Propagation (DTP) on MNIST.
☆13Oct 12, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
llmsresearch / scone
View on GitHub
Implementation and evaluation of Scaling Embedding Layers in Language Models research paper
☆15Feb 2, 2026Updated 5 months ago
xuyige / SoftCoT
View on GitHub
ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…
☆92May 30, 2025Updated last year
InternLM / SIM-CoT
View on GitHub
[ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"
☆212Apr 13, 2026Updated 3 months ago
jxiw / MambaByte
View on GitHub
[CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model
☆27Oct 12, 2024Updated last year
thu-nics / TaH
View on GitHub
[ICML'26] Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"
☆75Jul 17, 2026Updated last week
UCSB-AI / Soft-Thinking
View on GitHub
Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"
☆345Jun 12, 2026Updated last month
Zanette-Labs / efficient-reasoning
View on GitHub
☆75Apr 13, 2025Updated last year
zhaoxlpku / PromptCoT
View on GitHub
☆17Apr 10, 2025Updated last year
YihongDong / FANformer
View on GitHub
☆39Mar 25, 2026Updated 4 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Kaffaljidhmah2 / SpecDec_pp
View on GitHub
Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
☆19Jul 10, 2025Updated last year
lyh983012 / SNN-genunit
View on GitHub
developing tools for LIAF-SNNs and LIF-SNNs
☆10Sep 14, 2022Updated 3 years ago
Alsace08 / Chain-of-Embedding
View on GitHub
[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"
☆101Dec 19, 2024Updated last year
D2I-ai / dasd-thinking
View on GitHub
☆105Jan 27, 2026Updated 5 months ago
Mixture-AI / Mixture-of-Depths
View on GitHub
Google DeepMind: Mixture of Depths Unofficial Implementation.
☆12May 29, 2024Updated 2 years ago
ernoult / scalingDTP
View on GitHub
"Towards Scaling Difference Target Propagation by Learning Backprop Targets" (ICML 2022)
☆13Jan 17, 2023Updated 3 years ago
denkle / HDC-VSA_cookbook_tutorial
View on GitHub
This codes presents examples of constructing primitives for data structures with Hyperdimensional Computing/Vector Symbolic Architectures
☆17Jun 4, 2021Updated 5 years ago
shawntan / SUT
View on GitHub
Repository for Sparse Universal Transformers
☆20Oct 23, 2023Updated 2 years ago
LUMIA-Group / MemoryDecoder
View on GitHub
The official implementation of the paper "Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models" (NeurIPS 2025 Pos…
☆75Sep 29, 2025Updated 9 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
YuvrajSingh-mist / SmolLlama
View on GitHub
So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…
☆18Mar 26, 2025Updated last year
alessiodevoto / l2compress
View on GitHub
Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."
☆19Dec 13, 2024Updated last year
suoych / KEDs
View on GitHub
Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)
☆20Nov 4, 2024Updated last year
huskydoge / Awesome-Loop-Models
View on GitHub
A curated list of papers and selected technical blogs on Loop Models.
☆225Updated this week
BriansIDP / video-SALMONN-o1
View on GitHub
☆40Aug 26, 2025Updated 10 months ago
sail-sg / SimLayerKV
View on GitHub
The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.
☆54Oct 18, 2024Updated last year
LUMIA-Group / MLPMemory
View on GitHub
The official implementation of the paper "MLP Memory: A Retriever-Pretrained Memory for Large Language Models". (ICLR 2026)
☆68Jun 11, 2026Updated last month