Trae1ounG/Awesome-Parametric-Knowledge-in-LLMs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Trae1ounG/Awesome-Parametric-Knowledge-in-LLMs)

Trae1ounG / Awesome-Parametric-Knowledge-in-LLMs

Must-read papers and blogs about parametric knowledge mechanism in LLMs.

☆41

Alternatives and similar repositories for Awesome-Parametric-Knowledge-in-LLMs

Users that are interested in Awesome-Parametric-Knowledge-in-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yxyang1111 / Pseudo-Knowledge-Graph
View on GitHub
☆12Mar 1, 2025Updated last year
Hannibal046 / PlugLM
View on GitHub
[ACL2023] Source code for Decouple knowledge from paramters for plug-and-play language modeling
☆20Sep 18, 2023Updated 2 years ago
GSYfate / knnlm-limits
View on GitHub
Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"
☆24Apr 30, 2025Updated last year
oashua / MathAgent
View on GitHub
Code repo for MathAgent
☆20Dec 15, 2023Updated 2 years ago
T-Lab-CUHKSZ / G2RPO-A
View on GitHub
[ACL 2026] G2RPO-A: Guided Group Relative Policy Optimization with Adaptive Guidance
☆16May 20, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Trae1ounG / BuPO
View on GitHub
[arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
☆60Feb 6, 2026Updated 5 months ago
Trae1ounG / DyPRAG
View on GitHub
[arxiv: 2503.23895] Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement
☆182Aug 14, 2025Updated 11 months ago
Sphere-AI-Lab / OrthoMerge
View on GitHub
Implementation of <Orthogonal Model Merging>
☆33May 27, 2026Updated 2 months ago
zepingyu0512 / awesome-LLM-neuron
View on GitHub
☆36Jun 13, 2025Updated last year
shawntan / stickbreaking-attention
View on GitHub
Stick-breaking attention
☆63Jul 1, 2025Updated last year
NEUIR / P-ALIGN
View on GitHub
[ACL '26] source code for the paper: "Long-Chain Reasoning Distillation via Adaptive Prefix Alignment"
☆16Jan 21, 2026Updated 6 months ago
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated 2 years ago
zzp1012 / SAM-in-Late-Phase
View on GitHub
[ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"
☆19Feb 20, 2025Updated last year
GhadaSokar / WAST
View on GitHub
[NeurIPS2022] Where to Pay Attention in Sparse Training for Feature Selection?
☆12Feb 10, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JRC1995 / Continuous-RvNN
View on GitHub
Official Repository for "Modeling Hierarchical Structures with Continuous Recursive Neural Networks" (ICML 2021)
☆12Aug 18, 2021Updated 4 years ago
ozyyshr / RAST
View on GitHub
Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)
☆22Oct 16, 2025Updated 9 months ago
StarDewXXX / AdaR1
View on GitHub
The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"
☆24May 6, 2026Updated 2 months ago
qzp2018 / UniECS
View on GitHub
Official implement of CIKM2025: 《UniECS: Unified Multimodal E-Commerce Search Framework with Gated Cross-modal Fusion》
☆21Sep 17, 2025Updated 10 months ago
zeroxleo / HyperGT
View on GitHub
The implementation of ICASSP 2024 lecture presentation paper "Hypergraph Transformer for Semi-Supervised Classification"
☆21Nov 23, 2024Updated last year
Ginjing-Yuan / QWen2-from_ground_up
View on GitHub
☆22Jul 15, 2024Updated 2 years ago
liuzhao09 / DiffGRM
View on GitHub
☆26Sep 25, 2025Updated 10 months ago
hhan1018 / NesTools
View on GitHub
[COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models
☆18Jan 18, 2025Updated last year
Small-Model-Gap / Small-Model-Learnability-Gap
View on GitHub
☆23Oct 10, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
minglllli / CLS-RL
View on GitHub
[NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning
☆90Sep 19, 2025Updated 10 months ago
Yunhao-Feng / BackdoorAgent
View on GitHub
BackdoorAgent is a stage-aware framework and benchmark that instruments LLM-agent workflows (planning, memory, tools) to systematically i…
☆43Mar 16, 2026Updated 4 months ago
TIGER-AI-Lab / TheoremQA
View on GitHub
The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)
☆40May 15, 2024Updated 2 years ago
meituan / MemOCR
View on GitHub
MemOCR: an OCR-driven visual memory agent.
☆33May 17, 2026Updated 2 months ago
InternScience / MME-Reasoning
View on GitHub
Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs
☆45Jun 17, 2025Updated last year
PKU-TANGENT / ConFiguRe
View on GitHub
Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"
☆12Jul 27, 2023Updated 3 years ago
Commander-bao / XMU_Daily_Health_Report
View on GitHub
简单的厦大每日健康自动打卡，每天给您发通知，动动小手即可一劳永逸！
☆11Apr 24, 2023Updated 3 years ago
sustcsonglin / second-order-neural-dmv
View on GitHub
source code of COLING2020 "Second-Order Unsupervised Neural Dependency Parsing"
☆16Oct 24, 2022Updated 3 years ago
whyNLP / Probabilistic-Transformer
View on GitHub
A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.
☆26Oct 22, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
OpenBMB / ParamMute
View on GitHub
ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation
☆61Jul 7, 2026Updated 3 weeks ago
xmcp / PKU_EECS_UGR_THSS_2022
View on GitHub
Latex Template for Undergraduate Thesis at School of EECS, Peking University
☆42Jun 3, 2022Updated 4 years ago
TongDog / ultralytics-Ascend
View on GitHub
Ultralytics YOLO with Huawei Ascend 910B NPU support.
☆18Jan 26, 2026Updated 6 months ago
frankaging / Causal-Distill
View on GitHub
The Codebase for Causal Distillation for Language Models (NAACL '22)
☆26May 1, 2022Updated 4 years ago
runchu-tian / LongPiBench
View on GitHub
The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"
☆14Dec 16, 2024Updated last year
XuandongZhao / Ginsew
View on GitHub
[ICML 2023] Protecting Language Generation Models via Invisible Watermarking
☆13Sep 8, 2023Updated 2 years ago
tmlr-group / CoPA
View on GitHub
[NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"
☆11Nov 15, 2024Updated last year