Must-read papers and blogs about parametric knowledge mechanism in LLMs.
☆39May 9, 2025Updated last year
Alternatives and similar repositories for Awesome-Parametric-Knowledge-in-LLMs
Users that are interested in Awesome-Parametric-Knowledge-in-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Mar 1, 2025Updated last year
- [ACL2023] Source code for Decouple knowledge from paramters for plug-and-play language modeling☆20Sep 18, 2023Updated 2 years ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆24Apr 30, 2025Updated last year
- ☆35Jun 13, 2025Updated 11 months ago
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆24May 6, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆26May 27, 2025Updated last year
- The implementation of ICASSP 2024 lecture presentation paper "Hypergraph Transformer for Semi-Supervised Classification"☆21Nov 23, 2024Updated last year
- [NeurIPS2022] Where to Pay Attention in Sparse Training for Feature Selection?☆12Feb 10, 2023Updated 3 years ago
- ☆22Jul 15, 2024Updated last year
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆15Jun 26, 2025Updated 11 months ago
- Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision☆11Jul 22, 2024Updated last year
- ☆24May 19, 2026Updated last week
- Unofficial faiss wheel builder for NVIDIA GPU☆36Apr 29, 2026Updated last month
- ☆16Apr 29, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Source code for SWIFT, an efficient reward model.☆21Jan 13, 2026Updated 4 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]☆64Apr 11, 2026Updated last month
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated last year
- ☆26Aug 8, 2025Updated 9 months ago
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆19Oct 18, 2025Updated 7 months ago
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆52Nov 17, 2024Updated last year
- 简单的厦大每日健康自动打卡,每天给您发通知,动动小手即可一劳永逸!☆11Apr 24, 2023Updated 3 years ago
- BetterBench 的面包多小店☆14Feb 20, 2024Updated 2 years ago
- ☆21May 14, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆14Oct 25, 2024Updated last year
- SePer is an accurate / fast / free-of-API metric to measure document quality via information gain☆31Feb 22, 2026Updated 3 months ago
- [EMNLP 2025 main] C3 Benchmark: A Bilingual Benchmark for Spoken Dialogue Models Exploring Challenges in Complex Conversations☆30Dec 24, 2025Updated 5 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 6 months ago
- ☆12Jun 29, 2023Updated 2 years ago
- ☆25Updated this week
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- ☆13Jun 20, 2022Updated 3 years ago
- ☆18Feb 3, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 2024百度商业AI技术创新大赛赛道一:基于大模型的广告检索全国一等奖获奖方案☆16Feb 23, 2025Updated last year
- Some thoughts about writing scientific papers☆22Nov 8, 2024Updated last year
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆17Aug 15, 2025Updated 9 months ago
- ☆14Jun 21, 2024Updated last year
- This repository is the implementation of the model COLA from paper: Graph Contrastive Learning Meets Graph Meta Learning: A Unified Metho…☆12Aug 22, 2024Updated last year
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆30Mar 18, 2026Updated 2 months ago
- Training-free LLM-generated Text Detection by Mining Token Probability Sequences (ICLR 2025)☆37Apr 25, 2025Updated last year