Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"
☆25Jul 21, 2025Updated 7 months ago
Alternatives and similar repositories for PonderingLM
Users that are interested in PonderingLM are comparing it to the libraries listed below
Sorting:
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆70Dec 15, 2025Updated 2 months ago
- Code for Heima☆59Apr 21, 2025Updated 10 months ago
- ☆32Apr 14, 2022Updated 3 years ago
- Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference☆22Feb 9, 2026Updated 3 weeks ago
- Self-Questioning Language Models☆57Jan 5, 2026Updated 2 months ago
- [NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers☆38Oct 20, 2022Updated 3 years ago
- ☆12Jul 4, 2024Updated last year
- ☆10Nov 15, 2023Updated 2 years ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆18Feb 14, 2026Updated 2 weeks ago
- A library for handling Structural Causal Models and performing interventional and counterfactual inference on them.☆13Jul 3, 2020Updated 5 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 9 months ago
- End-to-end implementation of the Social Graph Network (SGN), described in the Structural Reasoning for Image-based Social Relation Recogn…☆13Apr 3, 2024Updated last year
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- ☆17Dec 23, 2025Updated 2 months ago
- A repo to keep all resources about interpretability in NLP organised and up to date☆12Nov 22, 2020Updated 5 years ago
- Data for evaluating GPT-4V☆11Oct 26, 2023Updated 2 years ago
- ☆18Jun 23, 2025Updated 8 months ago
- ☆10Jan 28, 2024Updated 2 years ago
- FlexiTokens☆18Dec 27, 2025Updated 2 months ago
- ☆13Sep 8, 2024Updated last year
- Scikit-learn vectorizer implementing "A simple but tough-to-beat baseline for sentence embeddings." by Arora, Sanjeev, Yingyu Liang, and …☆12Apr 1, 2018Updated 7 years ago
- ☆13Jul 8, 2020Updated 5 years ago
- [ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models☆56Jun 11, 2025Updated 8 months ago
- Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1☆14Mar 27, 2024Updated last year
- QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …☆13Mar 25, 2024Updated last year
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- ☆13Jul 2, 2025Updated 8 months ago
- LZW压缩算法的完整实现☆10Aug 14, 2014Updated 11 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- ☆12Jun 29, 2024Updated last year
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- LLM Beam Search Example Implementation☆13May 3, 2024Updated last year
- Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference☆13Jun 7, 2025Updated 8 months ago
- ☆13Nov 29, 2021Updated 4 years ago
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated last year
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs☆17May 21, 2025Updated 9 months ago
- ☆11Sep 25, 2020Updated 5 years ago
- PyTorch implementation of paper "Evolving Parameterized Prompt Memory for Continual Learning" in AAAI 2024 (Oral).☆14Apr 15, 2024Updated last year