[ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training
☆47Jul 18, 2025Updated 8 months ago
Alternatives and similar repositories for DynamicKnowledgeCircuits
Users that are interested in DynamicKnowledgeCircuits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Chat client for LLMs.☆15Jul 23, 2024Updated last year
- ☆17Dec 23, 2025Updated 3 months ago
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Jun 13, 2024Updated last year
- ☆17Apr 9, 2025Updated last year
- Reproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"☆13Jun 1, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Scalable Kubernetes-native implementation of the Open Data Fabric protocol for global collaborative data processing☆23Mar 28, 2026Updated last week
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 4 years ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆29Oct 23, 2024Updated last year
- ☆19Mar 10, 2025Updated last year
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated last month
- [NeurIPS24] "What makes unlearning hard and what to do about it" [NeurIPS24] "Scalability of memorization-based machine unlearning"☆21May 24, 2025Updated 10 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆126Aug 7, 2025Updated 8 months ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Jun 29, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Evolving LangChain agent architectures using the Quality-Diversity (QD) algorithm.☆17Aug 29, 2025Updated 7 months ago
- Ship a prebuilt Wine environment driven by box86 & box64 on Ubuntu Touch☆12Oct 10, 2024Updated last year
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆165Nov 14, 2025Updated 4 months ago
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11Oct 10, 2025Updated 6 months ago
- Code for "Automatic Circuit Finding and Faithfulness"☆17Jul 11, 2024Updated last year
- [ACL 2025] Knowledge Unlearning for Large Language Models☆49Sep 18, 2025Updated 6 months ago
- Exploring Model Kinship for Merging Large Language Models☆28Apr 16, 2025Updated 11 months ago
- Port of GGML to C#☆13Jul 1, 2023Updated 2 years ago
- AceParse: A Comprehensive Dataset with Diverse Structured Texts for Academic Literature Parsing☆44Sep 17, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- ☆10Jan 28, 2024Updated 2 years ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆56Apr 15, 2024Updated last year
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆24Mar 6, 2026Updated last month
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 8 months ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆27Jun 4, 2024Updated last year
- Library that provides metrics to assess representation quality☆27Feb 5, 2025Updated last year
- ☆26Feb 20, 2026Updated last month
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19May 19, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆55Sep 29, 2025Updated 6 months ago
- ☆25Feb 27, 2023Updated 3 years ago
- Benchmarking Optimizers for LLM Pretraining☆57Dec 30, 2025Updated 3 months ago
- ☆14Oct 11, 2023Updated 2 years ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆81Oct 29, 2025Updated 5 months ago
- A tool to automatically detect copy+pasted and vendored code between repositories☆76Apr 1, 2026Updated last week
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆26Aug 25, 2024Updated last year