[ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training
☆47Jul 18, 2025Updated 8 months ago
Alternatives and similar repositories for DynamicKnowledgeCircuits
Users that are interested in DynamicKnowledgeCircuits are comparing it to the libraries listed below
Sorting:
- ☆17Dec 23, 2025Updated 2 months ago
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Jun 13, 2024Updated last year
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆64Jan 26, 2026Updated last month
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆52Mar 5, 2026Updated 2 weeks ago
- Reproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"☆13Jun 1, 2024Updated last year
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 4 years ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆29Oct 23, 2024Updated last year
- ☆15Nov 9, 2025Updated 4 months ago
- ☆19Mar 10, 2025Updated last year
- [EMNLP 2025] Circuit-Aware Editing Enables Generalizable Knowledge Learners☆19Nov 17, 2025Updated 4 months ago
- ☆15Mar 11, 2025Updated last year
- TLS & API keys for your LLM APIs☆20Dec 17, 2025Updated 3 months ago
- ☆14Oct 24, 2024Updated last year
- Ship a prebuilt Wine environment driven by box86 & box64 on Ubuntu Touch☆12Oct 10, 2024Updated last year
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆165Nov 14, 2025Updated 4 months ago
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11Oct 10, 2025Updated 5 months ago
- Code for "Automatic Circuit Finding and Faithfulness"☆17Jul 11, 2024Updated last year
- [ACL 2025] Knowledge Unlearning for Large Language Models☆49Sep 18, 2025Updated 6 months ago
- Exploring Model Kinship for Merging Large Language Models☆28Apr 16, 2025Updated 11 months ago
- Port of GGML to C#☆13Jul 1, 2023Updated 2 years ago
- ☆21Dec 5, 2024Updated last year
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆27Oct 14, 2025Updated 5 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆56Apr 15, 2024Updated last year
- Library that provides metrics to assess representation quality☆24Feb 5, 2025Updated last year
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆24Mar 6, 2026Updated 2 weeks ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆27Jun 4, 2024Updated last year
- The Bootstrap UI for WPF.☆14Apr 7, 2025Updated 11 months ago
- ☆25Feb 20, 2026Updated last month
- ☆15Apr 26, 2025Updated 10 months ago
- A tool to automatically detect copy+pasted and vendored code between repositories☆75Mar 12, 2026Updated last week
- ☆25Feb 27, 2023Updated 3 years ago
- ☆14Oct 11, 2023Updated 2 years ago
- Benchmarking Optimizers for LLM Pretraining☆56Dec 30, 2025Updated 2 months ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆81Oct 29, 2025Updated 4 months ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆26Aug 25, 2024Updated last year
- AI-Powered Application of Make-up on Photos☆21Aug 30, 2021Updated 4 years ago
- ☆10Jun 12, 2019Updated 6 years ago
- ☆13Mar 10, 2025Updated last year