LUMIA-Group / PonderingLMView external linksLinks
Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"
☆24Jul 21, 2025Updated 6 months ago
Alternatives and similar repositories for PonderingLM
Users that are interested in PonderingLM are comparing it to the libraries listed below
Sorting:
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆68Dec 15, 2025Updated last month
- Code for Heima☆59Apr 21, 2025Updated 9 months ago
- ☆32Apr 14, 2022Updated 3 years ago
- Self-Questioning Language Models☆57Jan 5, 2026Updated last month
- [NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers☆38Oct 20, 2022Updated 3 years ago
- ☆12Jul 4, 2024Updated last year
- ☆10Nov 15, 2023Updated 2 years ago
- ☆10Jan 28, 2024Updated 2 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- End-to-end implementation of the Social Graph Network (SGN), described in the Structural Reasoning for Image-based Social Relation Recogn…☆13Apr 3, 2024Updated last year
- Scikit-learn vectorizer implementing "A simple but tough-to-beat baseline for sentence embeddings." by Arora, Sanjeev, Yingyu Liang, and …☆12Apr 1, 2018Updated 7 years ago
- A library for handling Structural Causal Models and performing interventional and counterfactual inference on them.☆11Jul 3, 2020Updated 5 years ago
- ☆18Jun 23, 2025Updated 7 months ago
- A repo to keep all resources about interpretability in NLP organised and up to date☆12Nov 22, 2020Updated 5 years ago
- FlexiTokens☆19Dec 27, 2025Updated last month
- Data for evaluating GPT-4V☆11Oct 26, 2023Updated 2 years ago
- ☆13Sep 8, 2024Updated last year
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 8 months ago
- ☆13Jul 8, 2020Updated 5 years ago
- ☆17Dec 23, 2025Updated last month
- [ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models☆56Jun 11, 2025Updated 8 months ago
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated last year
- Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference☆12Jun 7, 2025Updated 8 months ago
- PyTorch implementation of paper "Evolving Parameterized Prompt Memory for Continual Learning" in AAAI 2024 (Oral).☆14Apr 15, 2024Updated last year
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆29Jan 18, 2026Updated 3 weeks ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- LLM Beam Search Example Implementation☆13May 3, 2024Updated last year
- ☆12Jun 29, 2024Updated last year
- ☆11Aug 13, 2024Updated last year
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆21Jan 6, 2026Updated last month
- ☆13Nov 29, 2021Updated 4 years ago
- Implementation for <Understanding Robust Overftting of Adversarial Training and Beyond> in ICML'22.☆12Jul 1, 2022Updated 3 years ago
- LZW压缩算法的完整实现☆10Aug 14, 2014Updated 11 years ago
- ☆24Feb 4, 2026Updated last week
- Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1☆14Mar 27, 2024Updated last year
- ☆16May 21, 2025Updated 8 months ago
- ☆12Jul 6, 2023Updated 2 years ago
- ☆13Jul 2, 2025Updated 7 months ago